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ncient DNA has radically altered our under- 

standing of the evolutionary history of many 

plants and animals, as well as our own. It has 

revealed areas of our genomes that were in- 

herited from closely related archaic hominins, 

granted insight into past ecosystems, and un- 

veiled extinct species and populations from 

across the tree of life. Each sample of ancient 

DNA is a snapshot of a place and time, giving 

a brief glimpse into the past. A better un- 

derstanding of the DNA that was inherited 

by modern individuals—and, sometimes 

more importantly, the DNA that wasn’t— 

can inform us about many evolutionary 

processes, including adaptation, speciation, 
and domestication. 

As the field of ancient DNA matures, we 

now have the foundational understanding 
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ing ecological landscape, as well as our role in those 
changes. When human studies are done ethically and re- 
sponsibly, we can integrate with communities and redis- 
cover stories that have been lost or erased from history, 
thereby possibly reconnecting modern people to histori- 
cal relatives. Beyond these focused investigations, how- 
ever, the technical limits of ancient DNA remain unclear. 
Although methodologies for capturing ancient DNA have 
greatly improved, how long DNA can survive environ- 
mental damage in different climates and 
conditions is still an open question. 
Though we will never be able to get a full 
picture of the past, studies of ancient DNA 
continue to reveal previously unknown as- 
pects of history across organisms. This spe- 
cial issue examines the changing landscape 
of how ancient DNA is studied today, includ- 
ing previously untapped sources, improve- 


to appreciate the finer details granted by 
these snapshots. Studies of past plants and 


animals can help to illuminate the chang-  popcast 
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ments in technology, and ethical challenges, 
and what we've learned about ourselves 
through ancient DNA. 10.1126/science.adk9651 
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Herbaria are a growing 
resource for ancient 
DNA studies. The plant 
specimens shown 

here are Ambrosia 
artemisiifolia (this 
page), also known as 
common ragweed, and 
Solanum tuberosum 
(opposite page), also 
known as potato. 
Herbarium specimens 
contain not only 
biological materials but 
also notes from their 
original collection. 
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Going local with ancient DNA: A review of human 
histories from regional perspectives 


Maria C. Avila-Arcos'*}, Maanasa Raghavan’*+, Carina Schlebusch?*5*+ 


Ancient DNA (aDNA) has added a wealth of information about our species’ history, including insights 
on genetic origins, migrations and gene flow, genetic admixture, and health and disease. Much early work 
has focused on continental-level questions, leaving many regional questions, especially those relevant to 
the Global South, comparatively underexplored. A few success stories of aDNA studies from smaller 
laboratories involve more local aspects of human histories and health in the Americas, Africa, Asia, and 
Oceania. In this Review, we cover some of these contributions by synthesizing finer-scale questions of 
importance to the archaeogenetics field, as well as to Indigenous and Descendant communities. We 
further highlight the potential of aDNA to uncover past histories in regions where colonialism has 


neglected the oral histories of oppressed peoples. 


xponential increases in ancient human 

genomic data over the past 10 years has 

enabled the reconstruction of past migra- 

tions, admixture between human popula- 

tions and between anatomically modern 
human lineages and other hominins, natural 
selection, and infectious diseases [e.g., (J-3)]. 
Most of this wealth of information has been 
generated by a few groups of investigators 
whose primary focus, until recently, has been 
large-scale events such as the out-of and back- 
to Africa migrations and the initial peopling 
of continents, with an overrepresentation of 
European-focused studies. Since then, there 
have been calls to diversify paleogenomic re- 
search, to consider more local questions, and 
to go “beyond broad strokes” to sociocultural 
aspects such as kinship, marriage practices, 
and how polities and power were structured 
in the past (4). Although many big-picture mi- 
gration questions remain unanswered, partic- 
ularly in non-European contexts [see Boxes 1 to 
3 for a summary of existing broad-scale an- 
cient DNA (aDNA) research], there is growing 
literature focusing on site- and region-specific 
scales. It is noteworthy that in several broad- 
level studies, there are cases, at times deemed 
genetic “outliers,” that do not conform to the 
big migration narratives and/or regions that 
have a different demographic history com- 
pared with the reported continental-scale pat- 
tern. Such cases offer exciting and realistic 
snapshots of processes occurring at local scales. 
In this Review, we draw upon select paleo- 
genomic studies (Fig. 1) that illuminate local 
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histories in the Americas, Asia, Oceania, and 
Africa, and discuss the potential of the field to 
add fine-scale resolution to human demography, 
social structures, and health over time. 


Connecting ancient and modern peoples 


There is a recurrent interest in identifying 
genetic connections between ancient and 
present-day populations. Although there can 
be certain technical limitations in assessing 
population continuity with absolute certainty, 
including a bias in the availability of present- 
day reference populations, the identification 
of relationships at different levels of resolu- 
tion between ancient and modern peoples has 
generated rich knowledge in human paleo- 
genomics. For instance, it has allowed us to 
learn when the formation of the genetic struc- 
ture observed today was established, the char- 
acteristics of different past demographic events 
such as gene flow from other known or “ghost” 
populations, and to prove or contest hypothe- 
ses of population replacement. These studies 
have even allowed the identification of ances- 
tors and reaffirmed the connection to the land 
for Indigenous populations. 

Paleogenomic studies targeting African pop- 
ulations have assessed the existence of con- 
nections between African and Eurasian and 
Middle Eastern populations. For example, 
the use of aDNA has given insights about the 
origin of the genetic connection of present-day 
North Africans with Eurasian, Middle Eastern, 
and, to a lesser extent, sub-Saharan populations. 
The origin of this genetic makeup had been a 
subject of debate, with some attributing it to 
Paleolithic back-to-Africa migrations and others 
to migrations during the Neolithic period. Studies 
analyzing aDNA from 15,000-year-old remains 
in current-day Morocco have revealed substan- 
tial Eurasian genetic contributions that predate 
the Holocene and the emergence of farming 
practices (5). The Eurasian component of these 
Epipaleolithic individuals was related to Levan- 
tine Natufian populations, whereas a separate 
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study (6) showed genetic continuity betw Che 
Epipaleolithic hunter-gatherers and Early 1.-2— 
lithic farmers in Morocco, indicating a cultural 
transition rather than population replacement. 
Late Neolithic individuals exhibited an Iberian 
component (6). These findings suggest that 
the spread of farming practices into this re- 
gion of North Africa involved the movement 
of both ideas and people. 

In West Africa, an aDNA study on four indi- 
viduals between 8000 and 3000 years old from a 
rock shelter at Shum Laka, Cameroon (7), chal- 
lenged the assumption that the cave displayed 
the cultural development sequence of Bantu 
speakers from hunter-gatherers to more set- 
tled societies before the onset of the Bantu 
expansion. The genetic analysis revealed a sur- 
prising association of the ancient individuals 
with rainforest hunter-gatherers instead of 
Bantu-speaking groups. Although this unex- 
pected finding questions the in situ cultural se- 
quence of the Bantu-speaker culture at Shum 
Laka, it is still possible that Bantu-speaker an- 
cestors coexisted with rainforest hunter-gatherer 
ancestors who were buried in the caves. 

In Asia, the search for genetic connections 
between ancient individuals and present-day 
populations has yielded knowledge on the 
local human genetic histories of regions such 
as Korea and Tibet. A study by Gelabert et ai. 
(8) investigated the pre- and protohistory of 
Korea by shotgun sequencing eight ancient 
individuals from the Korean Three Kingdoms 
period (57 to 668 BCE) that precedes the Korea 
Kingdom and the present-day national iden- 
tity of Korea. The authors found that present- 
day Koreans are closely related to the Three 
Kingdoms period individuals and detected a 
past population structure that may be indica- 
tive of diverse groups coinhabiting the region. 
Similarly, paleogenomic studies of ancient 
individuals from the Tibetan Plateau and the 
Himalayan arc (9, 10) have shed light on early 
population structure and genetic diversifica- 
tion of plateau-specific lineages, and have also 
revealed ancient gene flow between this re- 
gion and East, South, and Central Asia, which 
has shaped its present-day genetic landscape. 

Similarly, paleogenomics projects performed 
between researchers in partnership with Indige- 
nous communities in North America have re- 
vealed the relationships between Ancestors 
and Indigenous descendants, as well as a more 
detailed account of the regional demographic 
history (7-13). By studying genome-wide data 
from ancient individuals from the Pacific 
Northwest Coast in Alaska, a region of interest 
because of its association to the coastal dis- 
persal hypothesis (see Box 3), Lindo et al. (11) 
identified an early ancestral genetic lineage 
seemingly stemming from a late Pleistocene 
coastal migration into the Americas. This 
ancestral lineage, discovered from the low-depth 
genome sequencing of an ~10,300-year-old 
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individual known as “Shuka Kéa” (“Man Ahead 
of Us”), was different from the ones previously 
identified from the late Pleistocene (/4) or 
early Holocene [Kennewick Man, referred to 
as “The Ancient One” by Native Americans 


(15)]. The authors also found that population 
structure existed among Native North American 
groups at least since the late Pleistocene. The 
same authors conducted a study of the genomes 
of eight present-day tribal members and 12 an- 


cient individuals from two archaeological 
sites on the Muwekma Ohlone tribe in the San 
Francisco Bay land spanning ~2000 years (13). 
The analyses of these present-day and ancient 
genomic data confirmed genetic continuity in the 
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Fig. 1. A non-Eurocentric view of human paleogenomic studies. (A) Map showing 
paleogenomic studies with a focus on non-European regional questions highlighted 
in this Review. Average geographic coordinates are shown for studies with samples from 
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several proximal locations. (B) The information that can be obtained from skeletal 
remains to reconstruct past histories can be divided into different categories. 
(C) Age range of the individuals sampled in the studies highlighted in this Review. 
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San Francisco Bay Area for the past 2000 years. 
These results stood in contrast to previous re- 
constructions based on archaeological and 
linguistic information, adding knowledge of 
Indigenous population history in California 
and western North America more broadly. 

Finding genetic connections between an- 
cient human remains and present-day popula- 
tions is of particular relevance to Indigenous 
communities, who bear strong links to their 
land and Ancestors and may seek repatriation 
of Ancestral remains. For example, a study from 
Australia sequenced 10 nuclear and 27 mito- 
chondrial genomes from precontact Aboriginal 
Australians and revealed past population struc- 
ture, including an east-west genetic gradient also 
seen today, as well as signals of regional genetic 
continuity (16). In the Americas, finding genetic 
continuity has been the form rather than the 
exception, as it has been detected in the Pacific 
Northwest (1, 12), California (73), Mexico (17), 
Central America (78), and Patagonia (19, 20) 
as early as 10,000 years ago and as late as 1000 
years ago. 


Sociocultural revelations 


A few recent works have used paleogenomics 
to investigate local aspects of population his- 
tories and societal structures at the community 
level, including past sociopolitical dynamics 
through investigations of burial traditions, kin- 
ship, sex, and social status. We highlight a few 
studies on this theme below, focusing on non- 
European contexts, and direct readers to 
Racimo e¢ al. for deeper coverage of European 
case studies on sociocultural aspects (4). 

Lee et al. (21) focused on the Xiongnu empire, 
which established the first “nomadic imperial” 
system in the Eastern Eurasian steppe. From 
archaeological and historical evidence, it is sug- 
gested that nomadic and sedentary groups 
converged into this region between 200 BCE 
and 100 CE, resulting in a multiethnic society. 
The authors investigated how this culturally 
characterized diversity, homan movements, 
and ethnopolitical organization at the local scale 
translated to genetic diversity. Through the anal- 
ysis of 18 ancient individuals from one aristo- 
cratic and one local elite cemetery, they found 
that both high- and low-status burials harbored 
high genetic diversity, likely representing the 
overall diversity of the Xiongnu empire. Lower- 
status individuals, often located in the periphery 
of the higher-status or elite burials, harbored 
higher genetic heterogeneity compared with 
the higher-status individuals, suggesting that the 
former were drawn from diverse parts of the 
empire. The authors also detected extended 
family groups and found that the highest-status 
individuals were primarily women, suggesting 
that the Xiongnu women likely had prominent 
sociopolitical roles. 

A similar study led by a Turkish team ex- 
plored how early sedentary Anatolian Neolithic 
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societies were organized (22). By inferring 
kinship and pedigrees for 59 Neolithic Ana- 
tolian individuals, including 22 ancient genomes 
generated from two important sites from this 
period, the authors observed a changing trend 
over time with respect to burial and kinship 
practices. In the 9th to 8th millennia BCE, 
related individuals tended to be buried to- 
gether in domestic structures, whereas in the 
7th millennium BCE, burial structures included 
biologically unrelated individuals, indicating a 
shift in the societal structure with respect to 
how familial relationships were regarded. 

In Africa, a recent study of the Swahili coast 
indicated asymmetric social interactions and 
sex-biased admixture between groups during 
back-to-Africa gene flow from various traders 
introducing different Eurasian components at 
different time periods (23). The study showed 
substantial genetic mixing between African 
and Asian populations, particularly those from 
Persia and India, between 1250 and 1800 CE, 
coinciding with the spread of Islam in the 
region. Subsequent interactions with Asian 
and African groups have further changed 
the ancestry of present-day Swahili coastal 
populations. 

In Australia, using uniparental markers from 
five Ancestral Aboriginal Australian remains 
from Cape York in northeast Queensland, a 
study found that individuals buried both in 
beach and rock shelter contexts shared close 
genetic affinities with present-day communities 
from the same region (24). By contrast, an indi- 
vidual with a more distinctive beach burial was 
genetically closer to present-day populations from 
another region, specifically central Queensland 
or New South Wales, thereby gaining more res- 
olution on past mortuary practices. 

As paleogenomics turns to more site-specific 
questions, aDNA in tandem with archaeology 
can aid in the reconstruction of past societal 
facets. Such studies can bring us closer to un- 
raveling complexities of human societies that 
are not always captured by larger narratives of 


migrations and admixture. However, we do need 
to be cognizant of ascribing identities to ancient 
individuals and societies that may be biased by 
present-day worldviews and practices. 


Subsistence and lifestyle 


The integration of paleogenomic studies, iso- 
topic analyses, and archaeological inferences 
has yielded substantial contributions to our 
understanding of the evolution of subsistence 
strategies. By providing valuable insights into 
the genetic characteristics of hunter-gatherers, 
agriculturists, pastoralists, and marine-based 
societies over time, these interdisciplinary ap- 
proaches have deepened our knowledge of 
how subsistence practices change over time. 

Genetic diversification of populations with 
different lifestyles and subsistence strategies 
is a particular recurrent theme in paleogenomic 
studies in the Americas (17, 19, 20). For example, 
the analysis of the genome of a 3000-year-old 
woman referred to as “Tat6ok yik s shaawat” 
found in southeast Alaska revealed a divergence 
time of at least 6000 years ago between coastal 
and inland populations (72). At the other end 
of the Americas, studies on the genomes of 
ancient Patagonians indicated an initial sepa- 
ration between maritime and terrestrial hunter- 
gatherer populations after the formation of the 
Magellan Strait ~8000 years ago, followed by 
diversification among marine hunter-gatherers. 
Apart from providing an approximate date for 
the appearance of marine hunter-gathering as 
a subsistence strategy, the study of these ancient 
Patagonian genomes also refined the timing of 
cultural transitions in the region (19, 20). 

In Asia, paleogenetic studies suggest re- 
gional heterogeneity in human expansions 
associated with the spread of agriculture and 
domestication. aDNA from Upper Mesopotamia, 
home to some of the earliest sedentary so- 
cieties leading up to the Neolithic Transition, 
suggests genetic connections between this 
region and Neolithic Anatolia that shaped the 
genetic signature of early Anatolian farmers 


a ey 
Box 1. Contributions of aDNA to the study of human evolutionary histories in Africa. 


The concentration of landmass on the equator creates hot and humid conditions in Africa, which has largely 
hindered the preservation of aDNA. This, coupled with financial constraints in developing countries, has 
delayed aDNA research in Africa and placed it in its early stages compared with other continents. Thus far, 
research in Africa has primarily focused on deep-time population structure (5, 7, 32, 48) and large-scale 
Holocene migration patterns connected to herding or farming practices and trading (1). aDNA studies using 
synchronous genetic analysis of modern and ancient African populations have shown that the earliest pop- 
ulation divergence among modern humans in Africa occurred between the Khoe-San groups of southern Africa and 
other African populations, whereas the hunter-gatherer groups of the central African rainforest diverged later 
(62, 48). A recent study on aDNA from Shum Laka (Cameroon) in West Africa proposed a radiation model involving 
lineages leading to Khoe-San hunter-gatherers, rainforest hunter-gatherers, East and West Africans, and a “ghost 
modern” lineage that no longer exists (7). A “ghost-archaic” source is also included in the model, and West 
African groups exhibit low levels of “ghost archaic” admixture, together with a larger amount of ghost- 
modern admixture. aDNA from anatomically premodern human remains might be crucial to resolving the 
question of archaic admixture in Africa, as it did for Neanderthal admixture outside of the continent. 
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(25, 26). Similarly, early millet-farming socie- 
ties in the Yellow River and West Liao River 
basins in northern China showed substantial 
shifts in their genetic profiles over the past six 
millennia that paralleled subsistence transi- 
tions (27). Specifically, intensification of millet 
farming in the West Liao River region was 
associated with an increased Yellow River ge- 
netic affinity. The role of human migrations in 
the introduction of dairy pastoralism in Central 
Asia has been investigated by several aDNA 
studies. They have, for example, found associa- 
tions between this subsistence and Afanasievo- 
related gene flow into Mongolia (28), but no 
such association was detected farther west in 
Xinjiang, where the earliest Tarim mummies 
likely represent a local agropastoralist popula- 
tion (29). In South Asia, genomic analysis of a 
single individual affiliated with the Bronze Age 
Indus Valley Civilization and 11 genetic outliers 
from Bronze sites in Iran and Turkmenistan can 
be modeled as a key ancestral source for many 
present-day South Asians (30). Lack of Anatolian 
farmer-related genetic ancestry in these ancient 
individuals could suggest local agricultural de- 
velopment in South Asia. 

Paleogenomic studies in Africa have been 
focused on the impact of agriculture on popu- 
lation distributions. For example, there have 
been studies of the Bantu expansion, which 
started ~4000 years ago and involved West 
African groups who spread throughout sub- 
Saharan Africa, replacing or intermixing with 
existing groups (37), or that of herder commu- 
nities from Northeast Africa, who also migrated 
southward, settling in East Africa and even 
reaching southern Africa (32, 33). These studies 
have used aDNA to deepen our understanding 
of population movements and the interac- 
tion between different groups during this trans- 


formative period. Several aDNA studies in Africa 
also provide comprehensive information on 
archaeology, social structure, subsistence prac- 
tices, and inferred diets [e.g., (7, 33)]. As an 
illustration, in a study focusing on individuals 
from central south Africa, isotope analysis indi- 
cated a low-trophic diet associated with farming, 
which contradicted the previous hypothesis of 
Stone Age hunter-gatherer affiliation (34). Sub- 
sequent aDNA studies confirmed their identifi- 
cation as Iron Age farmers genetically connected 
to present-day Bantu-speaking populations in 
the surrounding regions (37). 

Integrating paleogenomic studies, isotopic 
analyses, and archaeological inferences has 
expanded our knowledge of subsistence strate- 
gies and associated population dynamics, 
holding promising prospects for understand- 
ing the complexities of human history and cul- 
tural interactions in the future. Additionally, 
we acknowledge a rich body of literature on 
animal and plant domestication (35) and bone 
assemblages from archaeological middens 
(36, 37) that add to our understanding of past 
diets and subsistence. 


Disease and health 


aDNA studies have substantially advanced our 
understanding of local patterns in the disease 
and health of ancient populations (38). By ana- 
lyzing disease dynamics, transmission patterns, 
and genetic adaptations, these studies pro- 
vide valuable insights into the complex inter- 
play among lifestyles, past human migrations, 
and diseases in different regions (35). 

In Africa, for example, a paleopathology 
study of the skeletal remains of a 2000-year-old 
hunter-gatherer boy related to modern-day 
Southern African Khoe-San groups indicated 
poor health, leading to his death at a young 


| 
Box 2. Contributions of aDNA to the study of early population structure and movements 


in Asia and Oceania. 


Upper Paleolithic human genomes, primarily from East Asia and Siberia, have augmented our under- 


standing of the early diversification of modern human lineages out of Africa, their contributions to present- 
day genetic diversity, and introgression with Neanderthals and Denisovans (56). In the Holocene, the focus 
of most aDNA studies shifts toward broad-scale migrations across Asia and Oceania that are associated 
with subsistence and language expansions (1). For example, West Asia and the Near East have been central 


t 
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questions about the origins of early farmers and their subsequent expansions (1). A parallel exists in East 


Asia, another major domestication center, where paleogenomics tracks decreasing genetic differentiation 
between northern and southern East Asia after the Neolithic, reflecting gene flow between these regions, as 
well as genetic connections between coastal Neolithic groups from Siberia through Japan to Southeast 
Asia (56). Similarly, several paleogenomic studies associated the movement of groups with language 


expansions during the 


eolithic and Bronze Age in Eurasia. These include the origins and spread of Indo- 


European and Anatolian languages and Austronesian- and Austroasiatic-speaking groups migrating from East 
Asia to Island Southeast Asia and Oceania (1). Moreover, paleogenomics has traced long-range movements 
from East Asia to various regions of Oceania and the broader Pacific, some entailing multiple, diverse 
migration streams (1). Finally, aDNA has been used to investigate movements among Asia, Oceania, and 
the Americas. In northern East Asia, especially Siberia, studies have identified genetic lineages, such as 
the “Ancient North Siberians” and “Ancient Palaeo-Siberians,’ that are related to Native American founders, 
as well as later gene flow events between Siberia and North America (57). 
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age estimated at ~7 years (39). The shotgun 
metagenomics analysis revealed that the boy 
was infected with Rickettsia felis, a flea-borne 
pathogen that causes a typhus-like disease 
(40). The finding that R. felis existed at least 
2000 years ago among southern African Stone 
Age hunter-gatherer societies challenged pre- 
vious assumptions that R. felis was a new or 
emerging pathogen associated with sedentary 
lifestyles. 

In the Americas, the study of pathogen aDNA 
has been useful in understanding the effects of 
European colonization on disease prevalence 
[e.g., (4D] and the extent of pathogens that 
were present before contact in the Americas 
[e.g., (42, 43)]. For instance, aDNA extracted 
from precontact individuals from Peru with 
tuberculosis-associated lesions revealed that 
the strain responsible for these infections likely 
arrived in the Pacific Coast of South America 
through marine mammals (43). Similarly, com- 
parison of oral pathogen aDNA between the 
precontact and postcontact periods has shown 
a replacement of Indigenous strains by European 
strains, as well as a higher prevalence of these 
pathogens in postcontact times (42, 44). Several 
pathogens have been detected through aDNA 
in the Colonial era, including the infectious 
agents responsible for paratyphoid fever, syphi- 
lis, and hepatitis B (3, 41). 

aDNA has also provided insights into the 
incidence and transmission of pathogens in 
Asia. Identification of Late Neolithic to Early 
Bronze Age Yersinia pestis strains, causative 
agent of the plague, in Bronze Age genetic out- 
liers from the Cis-Baikal region is the first evi- 
dence for this lineage infecting individuals 
lacking Yamnaya-related steppe ancestry (45). 
Further reports of Y. pestis in Bronze Age sites 
in Yakutia and Cis-Baikal regions, representing 
the eastern-most Eurasian finding of this bacte- 
rium, may be indicative of broader spread of 
this disease in northeast Asia during the Bronze 
Age (46). aDNA studies have also highlighted 
the process of local adaptation to the extreme 
environmental conditions in the Tibetan Plateau 
and the Himalayas. Through the analysis of an- 
cient remains, valuable insights into the timing 
of selection for high-altitude adaptation has pro- 
vided a deeper understanding of how ancient 
populations successfully adapted to the chal- 
lenging conditions of these regions (9, 10). 

Overall, ongoing exploration of aDNA holds 
great potential for uncovering further details 
about ancestral health and genetic influences 
on our current health landscape. 


Community engagement in aDNA 


With rapid advances in paleogenomics research, 
engaging with Indigenous and Descendant com- 
munities is crucial to advancing ethical, trans- 
parent, and meaningful studies. Community 
engagement is critical in Africa, given the 
predominant health and medical focus of 
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population genetic studies in the region (47). 
As aresult, population history-focused research 
on modern-day groups and aDNA studies are 
mainly conducted by nonlocal research insti- 
tutions collaborating with local African institu- 
tions. Feedback to communities varies depending 
on previous engagement between them and 
collaborating institutions, with different teams 
adopting different approaches. Many aDNA 
projects in Africa were initiated through long- 
standing community-based archaeology projects 
spanning several decades [e.g., (23)] or estab- 
lished collaborations initially focused on modern- 
day population genetics research (48). Ethical 
concerns particularly relating to African aDNA 
research, the dignified treatment of human re- 
mains, and the protection of cultural heritage 
are important discussion topics and have been 
addressed in specific publications (49). 
Paleogenomicists engaging with Indigenous 
communities in the Americas have produced 
substantial research focused on regional in- 
quiries. One notable study resulted in the repa- 
triation and reburial of ancestral remains (77), 
and another involved genomic analysis of con- 
temporary tribal members and ancient individu- 
als after several years of community engagement 
and dialogue with the Muwekma Ohlone Tribe 
from the San Francisco Bay (13). These mutually 
beneficial collaborative efforts between research- 
ers and Indigenous communities yield valuable 
insights into local demographic events at a finer 
scale (50). Similarly, studies such as (6) that used 
aDNA techniques to facilitate repatriation of 
Ancestral remains with limited cultural contexts 
to Indigenous communities in Australia provide 
the impetus for similar work in other regions with 
acolonial past where families and Indigenous 
communities await the return of their Ancestors. 


Challenges 


With increasing focus on regional questions 
that tend to be inherently and closely inter- 
twined with histories of present-day popula- 
tions, especially Indigenous and Descendant 
communities, there is a need for tighter inte- 
gration of genetics, archaeology, and com- 
munity partnerships. Until this is achieved 
satisfactorily, current paleogenomic practices 
risk perpetuating imbalanced narratives with- 
out representation from all stakeholders, as 
well as widening the gap in scientific capacity 
between investigators in the Global North and 
the Global South. 

Studies with a regional focus involving local 
researchers tend to more accurately integrate 
the voices of the local communities [e.g., (1/-13)]. 
This is particularly important when Descendant 
communities belong to an Indigenous or mar- 
ginalized community. There is a growing body 
of work exposing the discontent of Indigenous 
peoples, especially from the Global North, for 
being underrepresented and not considered in 
paleogenomic studies in which “samples” from 
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Box 3. Contributions of aDNA to the study 
of initial peopling and early dispersal of 
humans into the Americas. 


The study of ancient genomes has contributed 
substantially to our understanding of how the 
Americas were initially peopled and how the 
first peoples dispersed throughout [reviewed 
in (57). The initial dispersal of peoples from 
northeast Asia into Beringia ~20,000 years ago 
was followed by an isolation period lasting a 
few millenia known as the Beringian Standstill. 
Humans then migrated south, likely through a 
coastal route ~17,000 years ago, followed by a 
rapid dispersal and branching into different pop- 
ations across the continent. The first major 
anching separating Northern North Americans 
NNAs) and Southern North Americans (SNAs) 
curred ~15,000 to 16,000 years ago, with ubiq- 
itous SNA ancestry detected in ancient and 
resent-day populations from present-day US to 
atagonia (57). South America was rapidly peo- 
ed by early SNA populations, followed by a 
continent-wide replacement by later SNA lineages 
at least 9000 years ago (57). The rapid disper- 
sal of populations after the NNA-SNA split at dif- 
ferent latitudes with contrasting environments 
implies that numerous adaptive and microevo- 
lutionary processes had to take place, resulting 
in local differentiation of human populations in 
the Holocene (1). 
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their Ancestors are analyzed (50). As recognized 
by Bader et al., this lack of engagement “emerges 
from the legacy of colonizing research in each of 
the fields [i.e. archaeology, biological anthropol- 
ogy, and genomics] from which paleogenomics 
draws” (51). Ultimately, the priorities and agen- 
das of researchers, institutions, and grant and 
publication structures should not supersede 
those of communities. Meaningful engagement 
requires dedicated time and funds to develop 
and sustain partnerships with local commu- 
nities (57). Nuances in community engagement 
exist, particularly in colonized contexts where 
oppression, murder, and forced relocations may 
make it less straightforward to identify De- 
scendant communities with whom to engage. 

Although the field of paleogenomics has 
grown by leaps and bounds since the publica- 
tion of the first ancient human and Neander- 
thal genomes, most of the success of this field 
has been limited, scientifically and in terms of 
financial and global recognition, to groups 
from the Global North who lead these studies. 
This creates a massive imbalance in who de- 
signs and executes paleogenomics research, 
who decides priority questions in the field, 
and who ultimately benefits from the research. 
Building local capacity, both within scientific 
collaborations and Indigenous and Descendant 
community partnerships, has been suggested 
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by both Indigenous researchers in the US (57) 
and researchers working in non-US contexts 
(52). It is generally acknowledged that efforts 
to be more inclusive, equitable, and engaged 
with local partners involve going beyond of- 
fering authorships. Researchers should con- 
sider avenues such as training and cowriting 
grants with local scientific collaborators, en- 
gaging regularly, asking for input on research 
questions, and returning results to Indigenous 
communities. 

As local aDNA research expands, the need 
for larger sample sizes presents a substantial 
challenge within the field. It is crucial to em- 
phasize the respect and preservation of human 
remains during the generation of aDNA data. 
Given the inherently destructive nature of 
generating aDNA data, it is important to con- 
sider the sequencing procedure for the sampled 
material, which is consumed during analysis. 
The two primary methods for extracting genetic 
information from ancient remains are capture 
sequencing and whole-genome shotgun se- 
quencing. The widely used 1240K capture panel, 
initially introduced in (53) and accounting for 
~70% of published ancient genomewide data 
as of 2023, has limitations in capturing rare 
variations in regions of low genetic diversity. 
Less biased capture panels that incorporate 
outgroup ascertainment schemes are now com- 
mercially available (54). However, the poten- 
tial for limited overlap between these panels, 
along with the persistence of residual bias and 
batch effects, may hinder future comparative 
studies (55). 

Although capture sequencing has long been 
considered a cost-effective option, the declin- 
ing costs of whole-genome sequencing and the 
expenses associated with commercial capture 
panels have substantially narrowed the cost 
advantage. Low-coverage whole-genome se- 
quencing of reasonably preserved human re- 
mains is comparable in price to commercial 
capture panels. Nevertheless, the key advantage 
of whole-genome sequencing remains its ability 
to provide comprehensive and unbiased genetic 
information, unlike the inherent bias introduced 
by capture panels. Whole-genome sequencing 
allows for the analysis of an individual’s com- 
plete genetic makeup, enabling direct population 
genetic investigations such as estimating pop- 
ulation split times, assessing genetic diversity, 
and tracking changes in effective population 
size over time. 

It is also worth noting that capture sequenc- 
ing does not amplify metagenomes or enable 
the identification of human-associated micro- 
organisms, including potential pathogens. To 
ensure the inclusion of metagenomes in analy- 
ses, it is important to encourage research groups 
to publish their complete raw metagenomic 
datasets alongside human genome-aligned 
binary alignment map (BAM) files. This call 
for standardized publication practices of 
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metagenomic data aims to enhance the com- 
prehensiveness of aDNA studies and facilitate 
thorough evaluation by the scientific community. 


Future directions and conclusion 


Paleogenomics research, although a relatively 
new field, continues to captivate the attention 
of scientists and the general public because of 
its groundbreaking contributions to our under- 
standing of the extensive population history of 
our species, spanning both ancient and more 
recent times. Although many studies have pri- 
marily focused on Europe and have addressed 
broader questions such as the out-of-Africa 
migration and the peopling of the Americas, 
this Review shifts its attention to regional in- 
quiries outside of Europe that have yielded 
substantial knowledge and impact. 

These highlighted studies conducted in the 
Americas, Asia, Oceania, and Africa share com- 
mon themes that have greatly benefited from 
the use of aDNA. These themes include fine- 
scale demography and social structure, shedding 
light on aspects such as short-range migrations, 
regional genetic continuity, the influence of sub- 
sistence strategies on genetic diversification and 
time since divergence, as well as genetic admix- 
ture at local and community levels. Additionally, 
investigations into kinship and family structures 
through the study of pedigrees have contrib- 
uted to our understanding of human societies 
in the past. Furthermore, the analysis of path- 
ogen aDNA has exhibited excellent potential 
in revealing population movements and social 
interactions on a regional scale. 

With recent advancements in wet lab tech- 
niques, DNA sequencing, and computational 
methods, the body of literature on aDNA is 
poised to expand substantially in the coming 
years. The exploration of both large-scale and 
regional questions, including those outlined 
in this Review, promises exciting frontiers 
that will further enhance our knowledge of 
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deep time and recent human evolution. It is 
crucial that the field progresses with a strong 
foundation built upon ethical, engaged, and 
responsible research practices. These princi- 
ples will serve as a cornerstone for the advance- 
ment of paleogenomics. 
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SPECIAL SECTION 


ANCIENT DNA 


REVIEW 


Ancient DNA genomics and the renaissance 


of herbaria 


Hernan A. Burbano’*+ and Rafal M. Gutaker?+ 


Herbaria are undergoing a renaissance as valuable sources of genomic data for exploring plant evolution, 
ecology, and diversity. Ancient DNA retrieved from herbarium specimens can provide unprecedented 
glimpses into past plant communities, their interactions with biotic and abiotic factors, and the 
genetic changes that have occurred over time. Here, we highlight recent advances in the field of 
herbarium genomics and discuss the challenges and opportunities of combining data from modern and 
time-stamped historical specimens. We also describe how integrating herbarium genomics data with 
other data types can yield substantial insights into the evolutionary and ecological processes that shape 
plant communities. Herbarium genomic analysis is a tool for understanding plant life and informing 
conservation efforts in the face of dire environmental challenges. 


erbarium specimens—meticulously col- 

lected and pressed plant samples—preserve 

a tangible record of botanical diversity 

and have long served as a foundation 

for botanical, taxonomical, and system- 
atics studies (7). Hosted across 3000 herbaria, 
these resources encompass close to 390 mil- 
lion specimens and their associated metadata 
(2). We owe this number to centuries of speci- 
men custodianship and current curation that 
ensures continuous growth, preservation, and 
sustainable use of this rich collection. Along- 
side their uses for botanical monographs and 
systematics (3), herbarium specimens covering 
diverse taxa and all continents increase the 
power of macroevolutionary studies, enabling 
investigations into trait evolution (4) and plant 
family radiations (5). 

Recent advances in high-throughput sequenc- 
ing technologies have facilitated the retrieval 
of genome-scale data from herbarium speci- 
mens (6). Herbarium-derived DNA can be 
classified as ancient DNA (aDNA) by its bio- 
chemical characteristics; it is highly fragmented, 
with distinct patterns of DNA misincorporations 
and breakage (7, 8). Such patterns result from 
postmortem DNA damage, which is accelera- 
ted during specimen preparation (9). DNA 
damage does not reflect the evolutionary his- 
tory of specimens, but it allows authentication of 
the historical nature of the herbarium-derived 
DNA (7, 10). Additionally, herbarium-derived 
DNA originates not only from the plant host 
but also its associated commensal or patho- 
genic microbiomes. Distinguishing between 
the genuine microbiome and postmortem col- 
onizers presents a methodological challenge 
(1D), but, if resolved, herbarium metagenomics 
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holds the potential to provide valuable in- 
sights into past biotic interactions. 

Lack of cell and genome integrity makes func- 
tional and structural genomic tools such as 
chromatin conformation capture unsuitable 
for historical tissues, and the fragmented nature 
of DNA derived from herbarium specimens limits 
de novo assembly of full herbarium genomes 
with comparable quality to modern genomes. 
However, short-read sequencing approaches 
have been developed to generate genome- 
scale data from herbarium specimens (12, 13). 
Although whole-genome sequencing allows 


“Herbarium collections 
offer valuable 
spatiotemporal data for 
investigating the genetic 
mechanisms underlying 


plant responses... 


genome-wide scans for selection to address ques- 
tions related to the genetic basis of adaptation, 
it can be substituted by reduced-representation 
techniques for addressing population history 
questions. By contrast, cost-efficient techniques 
such as enrichment capture of conserved genes 
and plastome sequencing are informative for 
macroevolutionary and systematics studies. 
Amplicon sequencing approaches have been 
reviewed exhaustively elsewhere (/4) and are 
outside the scope of this article. 

Despite the shortcomings associated with 
the degraded nature of herbarium-derived DNA, 
genomic studies of herbarium specimens offer 
a wide taxonomic and geographic sampling 
breadth to directly investigate the evolutionary 
responses of plants and their associated micro- 
biomes over the past 300 years of anthropo- 
genic global environmental change. 
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Contributions of herbarium genomics to or 


taxonomic and evolutionary studies ne 
Classification and macroevolutionary inference 
The integration of sequencing data with mor- 
phological data has substantially increased 
our ability to group plants into taxonomic 
units. In botany, species definition comes from 
type specimens, which were used to describe 
the characteristics of species in the first place. 
Once the genetic composition of type specimens 
is established, a newly collected plant can be 
identified to the species level using molecular 
characterization alone. A widely used approach 
involves coupling genome skimming with chloro- 
plast assembly or molecular barcode readout, 
allowing identification at the genus and species 
levels with high congruence with morpholog- 
ical identification (75). Such an approach was 
used to generate plastid and mitochondrial 
genomes from type specimens of red algae spe- 
cies (Pyropia) and allows molecular identification 
of unclassified material (6). Another challenge 
is to group species into higher taxonomic ranks, 
which often requires both morphological and 
genetic characterization. For example, genome- 
wide data helped resolve the challenging sweet 
potato genus ([pomea) (17). 

Herbarium collections harbor a wide range 
and diversity of species as well as geographies, 
which would not otherwise be accessible within 
the time frame of contemporary research 
projects. This is particularly useful for large- 
scale phylogenomic projects such as the Plant 
and Fungal Trees of Life, in which >28% of the 
taxa examined were sampled from herbarium 
specimens and were assessed using an enrich- 
ment capture approach (J8). The resulting 
phylogenetic trees inform our understanding 
of macroevolution and the history of plant life 
on Earth. For example, this approach contrib- 
uted to resolving the relationships of taxa in 
the mulberry family (Moraceae), and it indi- 
cated repeated, parallel evolution of inflexed 
stamens that facilitate wind pollination (19). 
Other uses include facilitating predictions of 
plant-based compounds, for example, the pre- 
diction of active compounds for medicinal use, 
a task in which herbarium collections play a 
special role (20). 

Preserving a permanent record of the source 
materials from which DNA was retrieved is 
also important. Consequently, large biogenome 
projects, such as the Darwin Tree of Life, which 
aim to build reference genome assemblies for 
vast quantities of species, have partnered with 
institutions that generate voucher herbarium 
specimens for the plant taxa sequenced (27). 


Tracing the evolutionary history of wild plants 


Herbarium collections offer valuable spatiotem- 
poral data for investigating the genetic mech- 
anisms underlying plant responses to global 
environmental change, including the influence 
of modern agricultural practices (22, 23). With 
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Fig. 1. The sampling range of herbarium specimens covers the time of intensified human 
influence on the environment. Sequencing the genomes of historical plants, as well as associated 
species, can address questions about their evolution and adaptation in response to anthropogenic 
pressures. Vertical bars indicate the intensity of human-mediated effects in the past five centuries 
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escalating global movement and trade, plants 
have become increasingly transported over long 
distances, heightening the likelihood of intro- 
duction of non-native species that might evolve 
into invasive species (24). Herbarium speci- 
mens have traditionally been used as proxies 
for documenting first-plant colonizers (Fig. 1). 
Sequences from herbarium specimens over 
dense time series can test hypotheses in inva- 
sion biology and determine whether changes 
in phenotypic traits are genetically encoded. 
Genetic analysis of herbarium specimens of 
garlic mustard (Alliaria petiolata) spanning 
50 years of North American invasion suggested 
that differential secretion of phytochemicals 
across this time period had an adaptive genetic 
basis (25). Herbarium genomics also recon- 
structed the North American population his- 
tory of a prevalent colonizing semiclonal lineage 
of thale cress (Arabidopsis thaliana) (26). A 
phylogenetic approach determined the time of 
colonization and connected the maintenance 
of de novo mutations in this lineage to root traits 
and flowering time that are likely important 
for the ecology of invasions. Human-induced 
alterations of landscapes has led to genetic mix- 
ing in the native range of common ragweed 
(Ambrosia artemisitfolia) in North America, 
before its introduction to Europe, potentially 
playing a role in its subsequent success as an 
invasive species (27). Furthermore, sequencing 
of numerous present-day and historical herba- 
rium genomes of ragweed revealed that Euro- 
pean populations exhibit a considerable turnover 
in genetic structure. This pattern is likely attrib- 
utable to multiple introductions, founder effects, 
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left) and crop (right) plants. Stacks of herbarium 


and genetic admixture with closely related 
European species (28). 

Anthropogenic activities additionally gener- 
ate habitats for wild plants. Agricultural fields 
face challenges from competition by weed spe- 
cies, requiring heavy use of herbicides to mini- 
mize interspecies competition and maintain 
crop yields. Excessive use of herbicides since 
their introduction in the 1950s has resulted in 
continuous evolution of weeds with resistance 
to these formerly deadly chemicals (Fig. 1) 
(29). Devising effective strategies for managing 
herbicide resistance necessitates determining 
whether resistance mutations originate de novo 
or were present in the existing genetic variation 
and to what extent gene flow plays a role in 
spreading herbicide resistance. Genotyping 
herbarium specimens of blackgrass (Alopecurus 
myosuroides) revealed that an herbicide re- 
sistance mutation located in the direct target 
of herbicides (target-site resistance) was segre- 
gating at a very low frequency approximately 
one century before the introduction of herbi- 
cides (30). A recent genomics study using con- 
temporary samples also suggested that standing 
genetic variation plays a crucial role in facilitat- 
ing the rapid evolution of target-site herbi- 
cide resistance (37). In addition to developing 
herbicide resistance, weeds must also adapt to 
the human-made environment of modern ag- 
ricultural fields. A recent study of waterhemp 
(Amaranthus tuberculatus), an agricultural 
weed native to North America, used geograph- 
ical pairing of hundreds of present-day genomes 
from natural and agricultural populations and 
atime series of hundreds of corresponding 
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herbarium genomes (32). These data allowed 
monitoring of changes in allele frequencies 
over time as well as estimation of selection co- 
efficients of variants observed predominantly 
in agricultural settings. 


A time machine for recent crop evolution 


As with wild plants, we can use time-stamped 
herbarium specimens of crop plants to address 
many important questions about their move- 
ment, adaptation, and changes in genetic diver- 
sity. Crop herbaria span a period of >500 years, 
with preliminary evidence showing that the 
oldest DNA sequences retrieved, from a tomato 
specimen (Solanum lycopersicum), date to 
~1558 (33). This time interval overlaps with 
the Columbian “exchange” of crops globally, a 
serendipitous test for the limits of environ- 
mental adaptation (Fig. 1). This period saw en 
masse movement of European staples to col- 
onies in the Americas, the introduction of 
American crops to Eurasia, and the global 
movement of cash crops for colonial exploi- 
tation. All continents were now connected 
by maritime routes, which resulted in rapid 
introduction of new crops, livestock, and dis- 
eases, with unprecedented effect on people’s 
well-being (e.g., illness) and the environment 
(e.g., agricultural pressures) (34). Herbarium 
specimens helped settle a long debate about the 
origins and adaptation of the potato (Solanum 
tuberosum) when introduced from the Americas 
to Eurasia. Sequencing of herbarium specimens 
revealed the genetic turnover of introduced 
varieties over time (35) and showcased the 
expanding role of hybridization, with geo- 
graphically distinct species that came into 
direct contact on the European continent (36). 
Putative preadaptation through phytohormo- 
nal regulation preceded an important change 
in day length-mediated tuberization that 
allowed potatoes to be cultivated in the com- 
pletely different European photoperiod re- 
gime (36). 

The past 250 years have seen increases in 
agricultural productivity. From the British Agri- 
cultural Revolution in the late 1700s to the 
Green Revolution in Latin America and South 
and Southeast Asia in the mid-1900s, new 
agrarian practices were accompanied by the 
adoption of new crop varieties (Fig. 1). Despite 
the general assumption that these processes 
had detrimental effects on the genetic diver- 
sity of crops, sequencing of seed bank mate- 
rials has provided little evidence supporting 
massive genetic diversity loss (37). Genome 
sequencing of time-stamped crops from herbaria 
will yield a direct measure of genetic diver- 
sity change, together with other demographic 
parameters. 

The postulated narrow gene pool available 
for crop breeding and the demand for genetic 
variants conferring stress tolerance, as well as 
recent developments in the field of plant 
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“neodomestication,” have sparked interest in 
the genetic diversity of crop wild relatives (38). 
In addition to the detrimental effects of land 
use change and habitat fragmentation on gene- 
tic diversity, reports suggest that wild relatives 
of major crops are heavily influenced by gene 
flow from cultivated counterparts, even in self- 
pollinating species such as rice (Oryza sativa) 
(39). The possibility of gene flow raises concerns 
regarding genetically modified organisms, from 
which, for example, herbicide resistance var- 
iants could be passed to wild and weedy species. 
Comparative genomics of time-stamped her- 
barium specimens of crop wild relatives will 
provide a direct measure of crop-to-wild gene 
flow risks over centuries. 


Herbarium specimens as time capsules of 
ecological interactions 


Shotgun sequencing of herbarium specimens 
enables characterization of not just plant ge- 
nomes but also genomes belonging to asso- 
ciated organisms. These include plant pathogens 
and the diverse microbial communities residing 
in the rhizosphere and phyllosphere, habitats 
for microorganisms on a plant’s aerial surface 
and at the root-soil interface, respectively. Spo- 
radically, other organisms such as insects, 
mollusks, and even small reptiles can be found 
in leaf creases. Therefore, herbarium speci- 
mens serve as windows into past biotic inter- 
actions, revealing coevolutionary trajectories 


between plants and their associated organisms 
(Fig. 2A). 

This field initially focused on detecting fila- 
mentous crop pathogens, such as fungi and 
water molds (oomycetes), stemming from the 
availability of infected plant herbarium speci- 
mens. These were collected by early plant pa- 
thologists and are curated alongside micro and 
macro fungi in specialized collections known 
as fungaria. The Irish potato famine pathogen 
Phytophthora infestans was detected by poly- 
merase chain reaction amplification of a short 
fragment of its mitochondrial DNA genome 
(40). Sequencing whole genomes from multi- 
ple historical samples considerably expanded 
the study of late blight, with a pandemic clonal 
lineage of P. infestans identified as the caus- 
ative agent triggering the Irish potato famine 
in the mid-1800s (41, 42). The strain from the 
19th century was completely replaced by other 
strains in the 20th century, although the 19th- 
century mitochondrial DNA is still in circula- 
tion (43). Combined analysis of historical and 
modern samples ascertained that polyploidi- 
zation distinguishes 20th-century samples 
from diploid 19th-century samples (47). More- 
over, changes in allele frequencies of pathogen 
genes modulating infection that are recog- 
nized by the plant immune system—effector 
genes—were identified after the introduction 
of resistant potato cultivars in the 20th cen- 
tury (41). 


Bacteria can also be accurately identified 
through analysis of herbarium-derived shot- 
gun reads. A proof-of-principle study using a 
library preparation method that specifically 
enriches aDNA molecules successfully assem- 
bled de novo genomes for commensal and 
pathogenic bacteria belonging to the genus 
Pseudomonas from a 19th-century potato 
herbarium specimen (44). Herbarium-derived 
genomes of Xanthomonas citri pv. citri (Xci), 
the causal agent of Asiatic citrus canker, have 
been used to reconstruct the population his- 
tory of this bacterial pathogen at both local 
(45) and global scales (46), revealing that the 
spread of Asian citrus canker disease is linked 
to climatic changes and the onset and global 
expansion of citriculture (46). 

Whereas agricultural disease outbreaks are 
often characterized by expansion of individual 
pathogenic lineages, wild plants are infected 
by genetically diverse groups of pathogens (47). 
Preliminary data annotating bacteriophages in 
present-day and herbarium-derived Pseudomonas 
genomes identified a putative bacteriophage- 
derived bacteriocin (tailocin) used by pathogenic 
bacteria for intralineage competition, which 
likely prevents expansion of single bacterial 
lineages (48). This study showcases the resolu- 
tion attainable through analysis of herbarium- 
derived bacterial genomes: determining the 
genomic location of the tailocins within 
Pseudomonas genomes, characterizing the 


Proteins 


Fig. 2. Environmental, molecular, and phenotypic wealth of herbarium specimens. 
(A) Herbarium sheets preserve not only the genomes of individual plants but also 
those of many associated species: microbes, such as bacteria and fungi, as well as 
macroorganisms, such as insects. (B) Herbarium specimens also harbor a plethora 
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of other types of data. Phenology and, occasionally, environmental or ethnobotanical 
information is recorded on the label. Anatomical features allow direct morphometric 
measurements. Other molecular data are available, from simple micronutrients 

and atomic isotopes to complex molecules such as alkaloids, RNAs, and proteins. 
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tailocin gene content, assessing conserved co- 
linearity of tailocin genes, and elucidating the 
haplotypes of highly diverse proteins likely 
crucial for tailocin bactericidal activity (48). 
Outside of pathogenic microbes, a targeted 
genomics approach was used to sequence and 
assemble symbiotic bacteria of wild yam 
(Dioscorea sansibarensis), revealing the hori- 
zontal transmissions of symbionts (49). 

Studies of fungi, oomycetes, and bacteria 
have focused on analysis of pathogen-infected 
herbarium tissue or highly abundant micro- 
bial taxa expected to be found on plant tissues 
according to reference microbiomes retrieved 
from fresh plant samples. Pure metagenomic 
assignment of herbarium-derived sequencing 
reads is complicated by short read length and 
nucleotide misincorporation, which decrease 
the reliability of read mapping. Once reads are 
reliably assigned to taxa, subsequent challenges 
lie in distinguishing between the authentic 
microbiome and postmortem colonizers. Inves- 
tigating whether patterns of DNA misincor- 
poration in reads mapped to specific microbial 
genomes resemble those commonly observed 
in aDNA (50) or using library preparation 
methods that selectively incorporate aDNA 
reads (44) is insufficient to distinguish the au- 
thentic microbiome. It is crucial to assess the 
significance of identified taxa by comparing 
them to reference microbiomes from fresh plant 
tissues and herbarium contaminants. A promis- 
ing alternative involves characterizing microbial 
taxa preferentially found in herbarium speci- 
mens across different plant taxa and multiple 
herbaria locations (17). This approach makes it 
possible to differentiate the authentic meta- 
genome from potential contaminants within the 
herbarium metagenome. Shotgun metageno- 
mics was used on present-day and herbarium 
genomes of A. artemisiifolia to characterize 
the pathogen load of plants inhabiting inva- 
sive and native ranges. A lower prevalence of 
disease-inducing plant pathogens in the inva- 
sive range suggests that escape from naturally 
occurring pathogens contributed to the invasion 
of this species in Europe (28). 

Herbarium specimens retain plant roots, a 
habitat rich in microorganisms, and in certain 
instances contain remnants of the original soil 
(Fig. 2A). Characterizing root microbiomes could 
potentially unveil the dynamics of microbial 
communities influenced by land management 
or agricultural practices, such as the applica- 
tion of agrochemicals. 


Beyond herbarium genomics: Integrating 
genomics with other data 

Combining herbarium genomics with other 
molecular data 


Herbaria have also proven valuable to scientists 
from other plant biology disciplines and for re- 
search on global environmental change (22, 23). 
These alternative applications encompass the 
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use of multiproxy molecular data. The biggest 
challenge in using molecules other than DNA 
from historic materials lies both in the limited 
understanding of their degradation and in the 
lack of authentication procedures. Although sim- 
ple elements and their stable isotopes are not 
expected to degrade within herbarium timescales, 
care must be taken to avoid contamination. Ecol- 
ogists as well as soil and plant scientists assess 
the state of ecosystems using nitrogen concen- 
trations and isotope ratios, which reflect the 
cycling of key chemical elements. Herbarium 
specimens of multiple species allow monitor- 
ing of changes in nitrogen availability on a 
regional scale (Fig. 2B). Despite increased 
anthropogenic nitrogen deposition, foliar nitro- 
gen concentrations have decreased (57). Combin- 
ing analysis of nitrogen and carbon isotopes 
with phenology traits reveals heterogeneous 
physiological responses of A. thaliana that 
depend on growth habit, population structure, 
and changes in the environment (52). Combin- 
ing such studies with genomic characteriza- 
tion might reveal the extent to which responses 
to anthropogenic pressures are genetic or plas- 
tic. This possibility is particularly relevant for 
crops, whose genetic makeup changed contem- 
poraneously with the onset of fertilization 
regimes during the Green Revolution. 
Complex chemical molecules produced by 
plants can modulate biotic interactions such 
as plant defense, plant-plant competition, and 
pollinator attraction. Thus, characterizing such 
molecules through time can reveal the evolu- 
tion of multiple biotic interactions. However, 
isolating and quantifying complex chemical 
molecules is more challenging than isolating 
DNA and requires fine-tuning of multiple spe- 
cialized approaches. It is important to under- 
stand the kinetics of each quantified molecule, 
including its half-life and degradation prod- 
ucts. A study comparing terpenoids, impor- 
tant metabolites for growth and defense, in 
different sages (Salvia sp.) noted a significant 
effect of species but not of herbarium collec- 
tion date on the terpenoid composition, al- 
though concentrations decrease over time. 
Therefore, herbarium specimens are suitable 
for qualitative comparison of terpenoids, but 
care needs to be taken for their quantitative 
analysis (53). Specimens of some plant species 
were annotated with their historical chemical 
composition at the time of collection, allowing 
direct comparison with modern measurements. 
For example, quinine alkaloid concentrations 
measured 150 years ago in fever tree (Cinchona 
sp.) barks are broadly congruent with recent 
measurements, suggesting that this group of 
compounds is stable (54). Degradation dynamics 
have been assessed for only a handful of bio- 
molecules and should be evaluated individually 
for economically important plant products (me- 
dicinal, plant defense). Direct measurement 
coupled with genome sequencing should fa- 
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cilitate discovery of underlying genetic path- 
ways (Fig. 2B). 

Proteomes have been successfully isolated 
from ancient animal samples. Proteins de- 
grade more slowly than DNA, and strict pro- 
tocols for authenticating their historical or 
ancient nature have already been developed 
(55). However, these approaches have not yet 
been applied to herbarium specimens. Although 
RNA is much less stable than DNA, and authen- 
tication procedures are not currently available, 
its sequencing can be authenticated indirectly 
through damage pattern signatures in com- 
plementary DNA, providing, for example, in- 
sights into plant-microbe interactions at the 
time of sampling. 


Leveraging phenotypic and genomic data from 
herbarium specimens 


Examining changes in the timing of pheno- 
logical milestones such as leaf-out or flower- 
ing allows the effects of climate change to be 
studied. Herbarium specimens are valuable 
and reliable records for phenological research 
(Fig. 2B). Automation of phenological scoring 
through machine learning approaches will 
expand the scope of herbarium-based pheno- 
logical studies, enabling assessment of more 
phenotypes (56). The collection date of flower- 
ing herbarium specimens provides a reliable 
estimation of past flowering time (57). For 
example, long-term phenology trends from 
herbarium specimens indicate that European 
forest wildflowers shifted their phenology in 
response to climate change (58). However, 
any genetic basis is yet to be determined. 
Integrating herbarium genomics with pheno- 
logical research will reveal whether variation 
in phenotypic traits has a genetic basis or is 
solely attributable to phenotypic plasticity. 
Another important phenotype in the context 
of global environmental change is stomatal 
density, a measure of the pores that plants use 
for gas exchange (Fig. 2B). Stomatal density is 
expected to decrease as atmospheric CO, 
levels rise with industrialization, as has been 
revealed by herbarium specimens (59). A poly- 
genic score-like metric that was devised by 
integrating herbarium and present-day genomes 
with known effects of gene knockout mutations 
on stomatal development preliminarily showed 
a positive correlation with stomatal density 
in A. thaliana (60). This metric was validated 
using present-day genomes and, when applied 
to herbarium genomes, reproduces previously 
observed patterns of stomatal density decrease 
over time. This correlation was achieved with- 
out directly measuring stomatal density in her- 
barium specimens, indicating a potential genetic 
basis underlying the observed patterns (60). 
Characterization of interactions between 
plants and insect herbivores is enabled by quan- 
tifying lesions on herbarium specimens (Fig. 1). 
Analysis of herbarium specimens spanning 
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more than a century indicates that damage to 
plants by insect herbivores will likely continue 
to increase in the northeastern US as global 
temperatures rise (67). Herbarium genomics 
might enhance these studies by identifying 
herbivores through insect DNA left on plant 
tissue as well as plant genetic loci responsible 
for susceptibility to insect herbivory. 


Outlook 


Herbarium genomic approaches reveal the pro- 
found influence of human activities such as 
agriculture on plant evolution. However, there 
is still much to explore, particularly regarding 
anthropogenic effects on plant interactions 
with the environment. Biotic interactions can 
be investigated through metagenomics of her- 
barium specimens, whereas isotope ratios, 
stomata numbers, and phenology provide in- 
sights into abiotic interactions. Although the 
influence of climate change on population diver- 
sity and structure is routinely established, its 
evolutionary consequences and adaptive re- 
sponses are rarely investigated (62). Integra- 
tion of present-day and herbarium genomics 
continues to be essential for studying rapid 
adaptation in natural populations. 

With time, new ways to use herbarium col- 
lections emerge, and their potential grows with 
the introduction of new analytical techniques. 
For example, recently, herbarium specimens 
have gained relevance for plant “de-extinction,” 
the resurrection of plants considered extinct in 
the wild using preserved seeds or tissues from 
herbarium collections (63). Herbarium genom- 
ics holds potential for informing and validat- 
ing such resurrection efforts. 

Finally, herbarium collections temporally 
overlap colonial history, reflecting the acquisi- 
tion and preservation of botanical treasures 
from continents explored by Europeans since the 
15th century (64). Most specimens were obtained 
without proper permits or consent from Indige- 
nous communities. Today, legal and ethical 
obligations regarding herbarium materials are 
guided by the Convention on Biological Diver- 
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sity (CBD) and the benefit-sharing principles 
of the Nagoya Protocol. Although the CBD ap- 
plies primarily to recent collections, ethical 
considerations extend to older materials, par- 
ticularly plants cultivated on Indigenous lands, 
including crops, medicinal plants, and sacred 
species. 
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Deep-time paleogenomics and the limits 


of DNA survival 
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Although most ancient DNA studies have focused on the last 50,000 years, paleogenomic approaches 
can now reach into the early Pleistocene, an epoch of repeated environmental changes that shaped 
present-day biodiversity. Emerging deep-time genomic transects, including from DNA preserved in 
sediments, will enable inference of adaptive evolution, discovery of unrecognized species, and 
exploration of how glaciations, volcanism, and paleomagnetic reversals shaped demography and 
community composition. In this Review, we explore the state-of-the-art in paleogenomics and discuss 
key challenges, including technical limitations, evolutionary divergence and associated biases, and the 
need for more precise dating of remains and sediments. We conclude that with improvements in 
laboratory and computational methods, the emerging field of deep-time paleogenomics will expand 
the range of questions addressable using ancient DNA. 


he Pleistocene epoch [approximately 
2.6 million years ago (Ma) to 10 thou- 
sand years ago (ka)] was a time of con- 
siderable environmental upheaval that 
shaped the present worldwide distri- 
bution of biodiversity. Environmental changes 
during the Pleistocene included cyclical fluc- 
tuations in global temperatures and precip- 


@ Non-human animal paleogenomes 


Early Pleistocene 


\ 


Kap Kobenhavn sediments 
(oldest ancient DNA) 


Krestovka 
mammoth 


itation patterns, advances and recessions of 
high-latitude ice sheets, and substantial changes 
in sea level, together with large-scale volca- 
nism, paleomagnetic reversals, and the global 
spread of humans (1). These events altered 
habitats around the world, driving changes in 
resource availability and ecological commu- 
nity composition. 
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Fig. 1. The temporal distribution of ancient DNA studies to date highlights 
gaps and opportunities for deep-time paleogenomics and sedimentary 
ancient DNA. Most ancient DNA studies fall within the last 50 ka and the most 
recent glacial cycle. The climate curve is based on benthic 8!°-oxygen 
measurements [per mil; LRO4 stack from (42)]. Sedimentary ancient DNA data 
are from the AncientMetagenomeDir (v23.06.0) (57) and (58), with meta- 
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The rich fossil record of the Pleistocene Che 
‘ ‘ upo 
been instrumental for testing hypotheses al. —- 
correlation between these environmental changes 
and biodiversity dynamics, especially at high 
latitudes where the cold climate favors fossil 
preservation. This is particularly true for the 
Late Pleistocene (126 to 11.7 ka), thanks to fine- 
scale inferences enabled by ancient DNA pre- 
served in fossils dating to this period. Such 
inferences have allowed insights into popula- 
tion turnover (2-4) and interspecies gene flow 
(5)—processes that are invisible to traditional 
paleontological techniques—and shown that 
demographic trends in large mammals closely 
track available habitat (6). 

Technical advances in DNA recovery have 
extended the ability to make these inferences 
deeper into the Pleistocene. DNA from bones 
and teeth that are several hundreds of thou- 
sands of years old (7-9) and beyond one million 
years old (10) has now been recovered and 
analyzed (Fig. 1). Such deep-time paleogenomes, 
which we consider here to refer to genomes 
assembled from organisms that lived during 
or earlier than the Middle Pleistocene, i-e., 
>126 ka, are still rare because postmortem pro- 
cesses lead to successive degradation of DNA 
molecules into increasingly small fragments, 
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making DNA recovery more difficult with age. 
Early and Middle Pleistocene DNA has, how- 
ever, been recovered from remains and sedi- 
ments in high-latitude permafrost (J0-74) and 
lower latitude caves (15, 16), suggesting that 
deep-time genomics is feasible in ideal pres- 
ervation environments. Here, we explore the 
current state of the art in deep-time paleoge- 
nomics research, the key obstacles preventing 
wider adoption, and scientific questions that 
deep-time paleogenomics can address. 


DNA persistence into deep time 


DNA does not survive indefinitely but it does 
survive for considerably longer than predicted 
by the earliest models. In 1993, Lindahl esti- 
mated that hydrolytic depurination would 
lead to complete degradation of DNA mole- 
cules within several tens of thousands of years 
(17). This limit has since been exceeded, and 
DNA is regularly recovered from remains and 
sediments that date to within the last 100 ka. 
As of September 2023, the oldest reconstructed 
paleogenome is from a permafrost-preserved 
mammoth dating to between 1 and 2 Ma (10) 
and the oldest isolated DNA is from ~2-Ma sedi- 
ment from northern Greenland (ZZ). However, 
the maximum age of recoverable and useful 
DNA molecules—those that are long enough 
to retain information—remains uncertain. 

DNA begins to degrade immediately follow- 
ing organismal death, initially through micro- 
bial and endogenous nuclease activity (Fig. 2). 
In nuclear DNA, strands are cleaved in labile 
regions of histone-DNA complexes, resulting 
in a ~10-base periodicity in the distribution of 
the lengths of recovered molecules (18). The 
primary chemical mechanism of DNA frag- 
mentation is hydrolytic depurination. This 
process removes adenine or guanine bases, 
creating abasic sites that can be cleaved by 8B 
elimination (79) (Fig. 2C), and leading to purine 
overrepresentation adjacent to strand breaks 
(20) (Fig. 2E) and interior gaps (27). Hydrolytic 
deamination, another common form of chemi- 
cal damage, converts cytosine to uracil and is 
observed as thymine in sequencing data, or 
“C-to-T transitions” (Fig. 2C). Deamination 
occurs primarily near strand ends and in single- 
stranded DNA (17, 21, 22) (Fig. 2E). DNA cross- 
linking (19, 22) and oxidative damage (20, 23) 
also occur but are observed less frequently than 
depurination and deamination. These typical 
damage patterns can be used to bioinformati- 
cally corroborate the authenticity of recovered 
ancient sequences and, to reduce their impact 
on sequence accuracy, can be identified and re- 
moved from ancient DNA data sets using stan- 
dard bioinformatic approaches. 

Recovery of increasingly old and damaged 
DNA is possible in part due to technical advances 
in the laboratory. Ancient DNA isolation meth- 
ods are optimized to recover both short DNA 
molecules and molecules containing nicks and 
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are too short to be useful. (A) The integrity of megabase length DNA is maintained by a cell's enzymatic 
repair machinery and, in eukaryotic genomes, packaged in histone-DNA complexes. (B) Following death, 


repair stops and DNA damage begins to accumulate. 
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gaps. Extracted molecules are prepared for se- 
quencing by ligating platform-specific adapt- 
ers to either double-stranded or single-stranded 
DNA. Single-stranded approaches to genomic 
library preparation (24, 25) convert natively 
single-stranded DNA as well as double-stranded 
DNA and more effectively convert molecules 
containing nicks and gaps compared with 
double-stranded approaches. DNA extracts are 


become too short to be identifiable. (E) A summary of base and mismatch frequencies 


using a single-stranded DNA library protocol [library 
of adenine and guanine bases adjacent to strand 

and observed throughout damaged reads. Whereas 
ibraries, single-stranded libraries show a C-to-T signal 
cules. 


and endonuclease VIII to reduce deamination 
damage by removing uracil bases (26). Although 
this approach reduces damage-induced errors 
in the resulting sequencing data, it also cuts 
the DNA backbone at abasic sites and shortens 
the recovered molecules by 5 to 10 nucleotides 
(26). Deep-time DNA molecules are already 
short—often <35 bases (15)—and consequently 
this may reduce the proportion of useful endo- 
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The short nature of deep-time DNA molecules 
makes them prone to spurious alignment and 
reference bias (27), complicating genome as- 
sembly and analysis. For example, ancient DNA 
data sets comprise both endogenous DNA from 
the target organism(s) and introduced exoge- 
nous DNA. These categories of molecules can 
be separated by identifying each read through 
taxonomic assignment, which can be problem- 
atic if the ancient organism has no close living 
relative to act as a genomic reference. Lack 
of a close reference, reference bias, and errors 
introduced by damage will also impede var- 
iant and consensus calling. Bioinformatic ap- 
proaches mitigate these challenges by directly 
modeling DNA damage and/or bias as part 
of genotyping (28) or considering only sub- 
stitutions that are not affected by cytosine 
deamination. Reference genomes can also be 
modified to create artificially closer references, 
such as a “Neanderthalized” version of the hu- 
man reference genome for reference-guided 
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mapping of Neanderthals reads (29). Genotype 
likelihoods rather than strictly called genotypes 
can also be used during downstream analysis, 
although imputation-based analytical methods 
may be inappropriate for deep-time data sets if 
ancient genomic diversity is not represented 
in existing reference panels. 


Research opportunities arising from 
deep-time DNA 
Speciation and evolution 


Speciation is not always a simple process of 
cladogenesis followed by reproductive isolation. 
Instead, modern and paleogenomic data have 
shown that interspecific hybridization is sur- 
prisingly common and perhaps driven in part 
by repeated habitat redistribution associated 
with glacial cycles (5, 9, 10). For example, brown 
bears and polar bears hybridized during pre- 
vious glacial and interglacial periods (30, 31) 
as well as in the modern era. Recently, polar 
bear and cave bear paleogenomes dating to up 
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Fig. 3. Deep-time paleogenomes provided new understanding of the evolutionary history of mammoths. 
Paleontological hypotheses assumed that the M. columbi lineage evolved after early divergence from 

M. primigenius (A), however, isolation of a deep-time paleogenome from the Krestovka mammoth (blue 
circle) revealed that M. columbi emerged more recently following admixture between the Krestovka and 


M. primigenius lineages (B). Adapted from (10). 


Box 1. Deep-time mammoth DNA and the inference of a lineage’s entire evolutionary story. 


The power of a deep-time genomic approach was showcased in a study by van der Valk and col- 
leagues (10) in which genome-wide data was collected from three Siberian mammoths dated to ap- 
proximately 700 ka to 1.2 Ma, which made it possible to examine mammoth evolution from multiple 


genomic angles: 


A new lineage: The oldest of the mammoth specimens belonged to a previously unknown and 
divergent evolutionary lineage, Krestovka. This implied that two distinct lineages of mammoth, Krestovka 
and the ancestors of woolly mammoths, lived in Siberia during the later stages of the Early Pleistocene. The 
analysis also revealed that mammoths belonging to the Krestovka lineage were the first mammoths to 


colonize North America 1.5 to 1.2 Ma (61). 


Hybridization: Multiple lines of evidence suggested that the Columbian mammoth originated as a 
result of hybridization between the Krestovka lineage and early woolly mammoths. This hybridization 
took place as woolly mammoths expanded into North America during the Middle Pleistocene, after the 
Krestovka lineage was already established on the continent (Fig. 3). Columbian mammoths derive 
approximately 50% ancestry from each of these two lineages. 

Adaptive evolution: The deep-time nature of the mammoth data set allowed van der Valk et al. to 
estimate the rate of adaptive evolution in mammoths. They concluded that the evolutionary origin of 
the woolly mammoth lineage did not coincide with an increased rate of protein-coding changes and 
therefore higher rates of positive selection across the genome (10). Subsequent analyses identified a 
suite of genes that underwent protein-coding changes during the last 700 ka and were thus specific to 


woolly mammoths (37). 
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to 360 ka revealed that all living brown bears 
derive a portion of their ancestry from admixture 
with these other bear lineages—evolutionary 
events that were invisible without these paleo- 
genomes (9, 32). Similarly, a mammoth paleo- 
genome dating to the Early Pleistocene revealed 
that Columbian mammoths (Vammuthus 
columbi) originated after hybridization between 
two distinct ancient mammoth lineages (10) 
(Fig. 3). Taxonomically diverse deep-time paleo- 
genomes could clarify the timing, rate, and 
extent of genomic introgression episodes and 
their role in evolution. Paleogenomic data 
from species that went extinct during the Early 
and Middle Pleistocene, such as short-faced 
hyenas, European jaguars, and the enigmatic 
Xenocyon canids, could shed light on whether 
these taxa contributed to the genetic make-up 
of living carnivores. Deep-time paleogenomes 
could also identify unknown “ghost” lineages 
that contributed to species’ ancestries, as ex- 
emplified in the paleogenomic characteriza- 
tion of the Krestovka mammoth (0) (Box 1 and 
Fig. 3). Deep-time DNA can reveal genomic 
snapshots of an individual species’ entire evo- 
lutionary story (Box 1). As many temperate and 
cold-adapted birds and mammals trace their 
origin to the Early and Middle Pleistocene 
(33, 34), paleogenomes from these species could 
correlate evolutionary changes to specific envi- 
ronmental perturbations, such as transitions 
between climate regimes or community reshuffl- 
ing. The process of speciation can be investi- 
gated as it happens, through exploration of 
founder event bottlenecks and testing whether 
speciation occurred through strict allopatry 
or gradually with post-divergence gene flow. 
As deep-time paleogenomes tend to occupy 
basal phylogenetic positions within their clades, 
they can also provide important calibrations 
for estimating rates of molecular evolution. For 
example, paleogenomic data from a Middle 
Pleistocene hominin from Sima de los Huesos 
in present-day Spain confirmed hypotheses 
from Late Pleistocene genomes that Neander- 
thals and Denisovans diverged during the early 
Middle Pleistocene (35), whereas the inclusion 
of a ~’700-ka horse paleogenome in the equid 
phylogeny pushed the estimated time for the 
origin of living equids to more than twice that 
previously hypothesized (8). 

Paleogenomes from the Early and Middle 
Pleistocene can also be used to test hypotheses 
about relationships among species, including 
how derived forms are related to earlier forms. 
An outstanding question in paleontology is 
whether fossil morphospecies are true species, 
synchronous ecomorphs, or chronospecies that 
were direct ancestors of succeeding species. A 
paleogenomic study of ancient North Ameri- 
can bison dating from ~130 to 110 ka, for ex- 
ample, showed that two samples exhibiting 
extreme size dimorphism and representing 
supposedly distinct species—the longhorn bison 
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and the steppe bison—actually belong to the 
same lineage that dispersed into North America 
only a few tens of thousands of years earlier (36). 
Conversely, deep-time paleogenomics can also 
give context to species for which we have only 
limited remains, such as Denisovans (35). 

Finally, paleogenomes across deep-time scales 
will make it possible to explore aspects of 
adaptive evolution. At the most basic level, 
deep-time genomes can help identify when 
adaptive mutations arose. For example, com- 
parative analysis of mammoth paleogenomes 
ranging from a few thousand to more than a 
million years old identified genes associated 
with hair and skin development, fat storage 
and metabolism, immune system function, 
and body size that evolved in the woolly mam- 
moth lineage within the last 700 ka (37). Paleo- 
genomes will also allow exploration of how 
the rate of protein coding changes varies over 
time, such as in conjunction with past changes 
in climate, as well as to assess when genomic 
deletions arose and the rate of positive and 
purifying selection in introgressed genomic 
regions. 


The impact of glacial cycles on biodiversity 


Nearly all ancient DNA studies to date have 
for practical reasons focused on Late Pleisto- 
cene or more recent materials (Fig. 1). Thus, 
our current understanding of evolutionary pro- 
cesses during the Early (2.6 Ma to 780 ka) and 
Middle (780 to 126 ka) Pleistocene sub-epochs 
relies mostly on more traditional approaches, 
including morphometrics, stable isotope analy- 
sis, and pollen records. Increasing access to 
genomic data from fossils and sediments dat- 
ing to these earlier sub-epochs will enable 
more explicit tests of hypotheses about how 
glacial cycles affect evolution and biodiversity. 

A special attribute of the earlier Pleistocene, 
for example, is the change in periodicity of 
glaciations from ~40 ka cycles to ~100 ka cycles 
that occurred 1.2 to 0.7 Ma (38) (Fig. 1). This 
change resulted in isolation of temperate spe- 
cies in glacial refugia for longer periods, pro- 
viding more time for local adaptation and 
increasing the rate of population divergence. 
Biological communities may also have been re- 
shuffled, as the longer and higher amplitude 
glaciations allowed sufficient ice sheet accumu- 
lation for the Bering Land Bridge to form, 
making land dispersal between Eurasia and 
North America possible. 

Since the change in glacial periodicity, the 
dominant pattern has been cycles of long gla- 
ciations separated by short, warm interglacials. 
This pattern is believed to have driven the demog- 
raphy and range dynamics of many species (39). 
Long interglacials, for example, have been cor- 
related with bottlenecks in cold-adapted taxa 
(40) and expansion and speciation in warm- 
adapted taxa (47). Of particular interest is the 
unusually long interglacial that occurred 420 
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to 370 ka (Marine Isotope Stage 11) (42). 
Paleogenomes from individuals that lived dur- 
ing this long bottleneck and earlier could be 
used to test these hypotheses and reveal evo- 
lutionary changes that were overwritten by 
subsequent genetic bottlenecks. 


Inference of ancient ecosystems 


We have described insights that could be de- 
rived from DNA extracted from remains of 
individuals that lived during the Middle Plei- 
stocene and earlier. However, the advances 
that enable deep-time paleogenomics also make 
it possible to reconstruct entire deep-time eco- 
logical communities. To date, only five studies 
have attempted to use sedimentary ancient DNA 
to reconstruct plant and/or animal commu- 
nities dating to the Middle Pleistocene or older: 
Kjeer et al. (11) reconstructed components of an 
Early Pleistocene interglacial ecosystem from 
sediment extracted from the present day polar 
desert in northern Greenland; Armbrecht e¢ ai. 
(43) reconstructed an Early to Middle Pleis- 
tocene marine ecosystem from Iceberg Alley 
in the Southern Ocean; Courtin et al. (12) re- 
constructed a Middle Pleistocene interglacial 
ecosystem from a permafrost megaslump in East- 
ern Siberia; and Willerslev et al. reconstructed 
Middle Pleistocene plant communities from sedi- 
ments collected below the Greenland ice sheet 
(4) and from coastal Siberian permafrost (73). 
Among these, Kjzer et al. and Armbrecht et al. 
enriched libraries for sequences of interest 
through hybridization to synthesized baits de- 
signed to target Arctic or Antarctic taxa. In 
contrast to the PCR-based metabarcoding 
approach used by the other three studies, 
hybridization-based targeted enrichment can 
capture molecules of any length and are there- 
fore powerful even when preserved molecules 
are short. Although the hybridization-based ap- 
proach is currently limited to capturing sequenc- 
es that are genetically similar to other known 
taxa, methodological improvements in hybrid- 
ization capture is a ripe area of research that will 
expand access to deep-time sedimentary DNA. 

Deep-time sedimentary DNA research will 
enable better understanding of the effect of 
glacial-interglacial transitions on community 
composition. Reconstructions of communities 
spanning the transition into the present Holo- 
cene, for example, have revealed rapid biological 
turnover that closely tracked abiotic changes 
(44, 45). Comparison with older transitions 
will test whether patterns are predictable or 
idiosyncratic, whether some species or commu- 
nities are more resilient to environmental up- 
heaval than others, and whether some transitions 
or events leave lasting signatures on commu- 
nity biodiversity. 

Reconstructions of communities that thrived 
in past warm interglacials may provide insight 
into the potential composition of communities 
in a future, warmer world (17), and improve our 


understanding of how ecosystem-level inter- 
actions among species evolve and are main- 
tained. They also enrich our understanding 
of these extinct ecosystems beyond what is 
knowable from the fossil record. Deep-time 
sedimentary DNA from northern Greenland, 
for example, revealed a mastodon or mastodon- 
like animal that was part of the Early Pleisto- 
cene community (17), although no fossil remains 
from such an animal have been discovered. 
Deep-time sedimentary DNA can also reveal 
past connectivity among populations, as indi- 
cated by a recent study of Late Pleistocene 
sedimentary DNA from a cave in Mexico that 
linked an extinct population of black bears 
to living populations in eastern North America 
(46). As technologies improve—particularly 
those that allow increasingly sensitive targeted 
enrichment—we envisage deep-time sedi- 
mentary DNA as a powerful tool to explore 
the ecological and evolutionary consequences 
of environmental change on community-level 
biodiversity. 


Future research to enable recovery 
of deep-time DNA 


It has been shown that DNA can survive in 
ideal preservation conditions into at least the 
Early Pleistocene. The next phase of deep-time 
DNA research is to expand the taxonomic, 
geographic, and temporal range of recovered 
and authenticated deep-time DNA. This chal- 
lenge presents new research opportunities in 
the field, at the bench, and bioinformatically. 
Deep-time genomics is today mostly con- 
ducted on substrates with optimal DNA pres- 
ervation such as those derived from permafrost 
or caves. However, more efficient approaches 
to recover ancient DNA molecules will con- 
tinue to expand the range of samples and 
substrates suitable for analysis. Today, meth- 
ods for DNA extraction and library conversion 
do not recover all potentially preserved DNA 
molecules. For example, Kjaer et al. (17) found 
that DNA adsorbed preferentially to clay 
mineral surfaces compared with nonclay 
surfaces, particularly to the clay mineral 
smectite, which can bind 200 times more DNA 
than quartz and is a common mineral in terres- 
trial samples. Their best performing extraction 
protocol recovered 40% of DNA bound to 
quartz and only 5% of DNA bound to smectite, 
suggesting that most DNA was inaccessible. 
Although anecdotal, this observation points 
to several opportunities for improving deep- 
time DNA research, including using minera- 
logical characterization to identify the most 
promising sites for deep-time sedimentary 
DNA recovery and refining experimental ap- 
proaches to recover DNA bound to all mineral 
surfaces. In the absence of improved methods 
to release bound DNA, microscopic evaluation 
of sedimentary samples will improve the ef- 
ficiency of DNA recovery. Massilani et al. 
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(47), for example, showed that DNA preserved 
in cave sediment is concentrated in microscale 
particles, especially fragments of bone and 
feces preserved within the substrate. 

Library conversion protocols could also be 
made more efficient. Optimized library con- 
version protocols use enzymatic ligation and 
polymerization, but ancient DNA extracts con- 
tain inhibitors as well as molecules with un- 
characterized DNA damage. Although we can 
convert as little as 100 picograms of DNA into 
libraries using the Santa Cruz method (25), 
library preparation has been shown to typ- 
ically convert only ~10 to 50% of extracted 
DNA (21), suggesting that most recovered 
molecules are lost at this experimental step. 
Improvements in library preparation may 
include engineering more robust enzymes to 
combat inhibitors or developing protocols that 
incorporate enzymatic repair during library 
conversion. Additionally, reducing reliance on 
ligase and polymerase steps through alter- 
native enzymatic strategies, bio-orthogonal 
chemistry, or native DNA sequencing may 
offer new approaches to convert currently un- 
sequenceable DNA molecules. 

Many species that are obvious targets for 
deep-time DNA research are extinct and some 
such as Xenocyon canids and basal members 
of the elephant and horse families have no 
evolutionarily close living relative for which 
an ideal reference genome can be produced. 
This presents challenges to ancient DNA au- 
thentication and identification as well as to 
reference-guided genome assembly. Although 
the average fragment length of deep-time DNA 
sequences is short, it may be possible to gen- 
erate de novo assemblies from ancient extracts 
by capitalizing on methods that use chromo- 
some conformation capture to retain proximity 
information useful to link short reads within a 
chromosome (48). Approaches that sequence 
DNA in situ (49) are also promising but are 
currently in early stages of development. Im- 
provements in bioinformatic processing will 
also benefit eukaryotic paleogenomic recon- 
struction and variant calling. Recently, mi- 
crobial genomes were assembled from DNA 
recovered from relatively recent paleofecal 
samples (50) and from archaeological dental 
calculus dating to as old as 100 ka (57), suggest- 
ing a bioinformatic path toward de novo assem- 
bly of some small paleogenomes. Although 
this approach is not likely to apply to com- 
plex eukaryotic genomes, other bioinformatic 
approaches can improve the accuracy of these 
assemblies from short read data. Replacing 
linear single-species reference genomes with 
multispecies variation graphs that incorpo- 
rate variants from several genomes (52), for 
example, can increase the number of reads 
that map to a reference genome. This approach 
has the additional benefit of allowing variation 
among indel lengths as well as among nucleo- 
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tides. Iterative assembly approaches, such as 
the mapping-iterative-assembler used to gener- 
ate the first Neanderthal mitochondrial genome 
(63) may improve mapping to more complex 
genomes. Finally, as reference-based taxono- 
mic assignment is always limited to sequences 
deposited in public databases, the ongoing 
population of these databases will continue to 
improve robust identification of DNA recov- 
ered from Early and Middle Pleistocene re- 
mains and sediments. 

One considerable challenge for studies of 
deep-time DNA is knowing the age of samples 
so that they can be placed into broader evo- 
lutionary and geological contexts. As most 
ancient DNA to date is from organisms that 
lived within the last several tens of thousands 
of years, it is usually possible to estimate their 
age directly using radiocarbon dating. How- 
ever, the short radioactive half-life of carbon- 
14 means that age estimates are often unreliable 
if organisms lived more than ~50 ka. Trapped 
charge dating methods, such as electron spin 
resonance (ESR) for tooth enamel or lumines- 
cence approaches for minerals such as quartz 
and feldspar, can provide age estimates for 
samples dating throughout the Pleistocene 
but require that sediments remain undisturbed 
since burial [for a review see (54)]. When pro- 
teins are preserved, the extent of amino acid 
racemization, hydrolysis, and decay can also 
estimate time since death, although amino acid 
“clocks” are known to vary among species and 
localities (54). 

In some cases, paleoenvironmental, geologi- 
cal, and geophysical markers can provide clues 
about a sample's age. A fossil might be found 
in the Arctic with other paleoecological proxies 
that suggest a warm and wet environment, 
for example, indicating that the animal lived 
during a previous interglacial or in sediments 
with reversed polarity suggesting that it lived 
prior to the last paleomagnetic reversal some 
780 ka. In some environments, tephra beds— 
layers of fine, settled volcanic ash—can be 
dated by methods such as glass fission-track or 
argon-argon dating. Tephra beds, which are 
detectable even when present in only micro- 
scopic amounts (55), have been particularly 
important in dating sediment cores but can 
also provide contextual clues about the age 
of samples found in situ at sites where tephra 
is present. As volcanic eruptions were common 
throughout the Pleistocene, improved tephro- 
chronologies spanning the Early and Middle 
Pleistocene will help place deep-time DNA 
into a chronological context. 

Other approaches to dating deep-time ge- 
nomes might rely on the predictable nature of 
evolutionary change in organisms. Molecular 
clock methods infer the age of paleogenomes 
by estimating the amount of “missing” evolu- 
tion along a phylogenetic branch leading to the 
paleogenome, often called “branch shortening” 


(56). Because the accumulation of mutations is 
approximately constant over time, the differ- 
ences between these branch lengths should 
correspond to the number of generations that 
separate the represented paleogenome from 
extant or more recent individuals. To translate 
missing generations into calendar time, how- 
ever, the branch shortening approach requires 
either an independent fossil calibration or an 
estimate of generation length. For many line- 
ages that lived during the Early and Middle 
Pleistocene, dated ancestral fossils are few and, 
with no close living relatives, estimates of 
generation time would be imprecise. Variation 
among evolutionary rates between distantly 
related lineages may also reduce the power of 
a comparative molecular dating approach. None- 
theless, development of approaches that use 
genomic information to estimate the age of 
paleogenomes and their evolutionary relation- 
ships to other species is a rich area for future 
research. 


Conclusions 


The next decade will bring continued techni- 
cal advances that will expand the taxonomic and 
geographic range of deep-time paleogenomes 
and deep-time ancient sedimentary DNA data 
sets. These will include new insights into what 
substrates are likely to preserve deep-time an- 
cient DNA, as well as refined approaches to 
release DNA bound to biological or mineralo- 
gical matrices. These newly assembled deep-time 
paleogenomes will be placed into chronological 
context through developments in geochronology 
and paleoecology, together with more powerful 
computational approaches to estimate the age of 
samples using a molecular clock. The resulting 
deep-time data sets will enable reconstruction of 
evolutionary histories across repeated environ- 
mental perturbations, refining understanding of 
adaptive evolution, community organization, 
and ecosystem resilience. Moreover, as the past 
by its nature is different from anything that 
exists today, access to deep-time DNA provides 
ample yet unpredictable opportunities for sci- 
entific discovery. 
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EDITORIAL 


Modern government and science advice 


overnments need to understand science. This 
is obvious when thinking about defense and 
security, health, or the challenges of climate 
change and biodiversity loss, but it is true for 
all areas of government activity. Science has 
something to offer in developing policy for 
town planning, education, transportation, food, 
environmental management, the administration of jus- 
tice, communication systems, the use of artificial intelli- 
gence to improve public services, and much more. Seven 
out of the 10 largest companies in the world are based 
on science and technology, and there is a correlation be- 
tween spending on R&D and productivity at a national 
level. Yet in most democracies, scientists make up a tiny 
fraction of politicians, and recently 
there have been questions about the 
overextension of scientific authority 
(so-called “scientism”) or the inap- 
propriate empowerment of scientists. 
What, then, is the role of a science ad- 
viser to modern government? 

In 1974, Solly Zuckerman, the 
United Kingdom’s first official scien- 
tific adviser, said that decision-mak- 
ing rested with government ministers, 
and that “If scientists want more than 
this then they’d better become politi- 
cians.” This notion was also captured 
by Winston Churchill, who said “scientists should be on 
tap not on top.” These ideas may be true in a democ- 
racy, but it doesn’t mean that science won’t be ignored 
or misused by politicians, nor does it mean that what 
scientists say will be understood. 

Effective interaction with government is challenging 
for science advisers in the presence of a rising tide of 
public anti-science sentiment. To overcome mistrust of 
science, science advisers should consider four key is- 
sues when engaging with governments. It is essential 
to ensure that the evidence base for recommendations 
is adequate, and if not, figure out what, if anything, 
can be done to supplement it in a timely manner. It 
is crucial that advice given to government is under- 
stood, including the uncertainties. A major obstacle is 
that nonscientists may often view science as definitive 
and providing an immutable truth, rather than being a 
self-correcting process of discovery. Scientists have not 
done a good job in describing the process of science, 
and how consensus can change with time. This has led 
to the exploitation and subversion of evidence by those 
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“Science 
advice requires 


dialogue, not 
data dumping.” 


who stoke mistrust in science. Wearing masks and vac- 
cination during the pandemic are stark examples. 

Advice should be in a form that is relevant to policy- 
makers. For example, although it is interesting to know 
the precise rate of mutation in coronaviruses, this infor- 
mation is unlikely to help guide policy decisions unless 
placed into context. What are the implications that fol- 
low from that observation? What are the relevant social 
and behavioral issues? Science advice requires dialogue, 
not data dumping. Advice to government also must ex- 
plain how science can be used to monitor the effects of 
any policy choice. Unfortunately, rigorous evaluation of 
the effects of new policies, and the transparency of the 
evaluations, is underdeveloped in many countries. 

Effective science advice is not pos- 
sible unless there is a strong indepen- 
dent science base that government 
advisers can call on. And it is not just 
academics that are needed—often, the 
expertise lies in industry or in gov- 
ernment laboratories. Science advice 
should be published so that the public 
and the scientific community can pro- 
vide constructive critique. This is, ulti- 
mately, the very way in which science 
progresses. Publication also shows 
how and whether science affected the 
ultimate policy choice made. 

Science advice is unlikely to be effective if govern- 
ments lack scientists. This absence often means that 
there are not effective mechanisms to absorb recom- 
mendations and evaluate policy implications. The lack 
of scientifically trained individuals in government is 
striking. For example, in 2018, university graduates 
entering the “fast stream track” of the civil service in 
the United Kingdom were overwhelmingly made up 
of students from the arts and humanities; only about 
10% held a degree in science. Attracting more scien- 
tists into government will require new approaches to 
recruitment and training. 

Policies must balance conflicting interests while ad- 
dressing specific problems. Science advice cannot be 
followed slavishly, but if decision-makers ignore such 
advice, or treat it as a box-ticking exercise, then the 
systems that uphold societal well-being will be at risk. 
The most stark example of this is climate change—and 
we are witnessing the results of that crisis. 


-Patrick Vallance 
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PHYSICS 


Fusion experiment 
beats previous.energy record 


ightning has struck a second time for physicists 
using lasers to achieve nuclear fusion, in which 
two atomic nuclei combine into one while releas- 
ing enormous amounts of energy. On 30 July, the 
192 lasers of the stadium-size National Ignition 
Facility (NIF) at Lawrence Livermore National 
Laboratory instantaneously crushed a tiny capsule 
filled with heavy isotopes of hydrogen. In doing so, they 
prompted a fusion reaction that produced more energy 
than the laser beams deposited onto the target. The new 
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The National Ignition Facility’s 
reaction chamber (central 
blue sphere), surrounded by 
laser beam tubes and 
diagnostic instruments. 


results, presented by NIF scientists at a conference last 
week, mark the second time in 8 months that NIF has 
achieved “energy gain.” NIF’s December 2022 experi- 
ment released 3.15 megajoules (MJ) of fusion energy— 
the equivalent of about three sticks of dynamite, and 
54% more energy than went into the target. In the July 
test, a shot yielded 3.88 MJ, an 89% surplus. The results 
will help researchers learn which factors separate pow- 
erful blasts from duds—a possible step toward one day 
making fusion a reliable energy source. 


Flu shot to drop vanished strain 


INFECTIOUS DISEASE | An influenza virus 
variant that circulated widely before the 
COVID-19 pandemic has disappeared, 
leading an expert group that advises the 
World Health Organization (WHO) to 
recommend last week that it no longer be 
included in flu vaccines. The strain, known 
as Yamagata, has not been detected since 
March 2020, and some scientists suspect it 
went extinct because of social distancing 
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measures imposed during the pandemic. 
WHO’s advisory committee, which meets 
every February and September to deter- 
mine which viral strains should be in flu 
vaccines, noted that because making flu 
vaccines requires growing the virus, a 

lab accident could in theory reintroduce 
Yamagata. To reduce this risk, “every effort 
should be made to exclude it as soon as 
possible,” it concluded. National regulators 
ultimately decide on the composition of flu 
vaccines in their own countries. 


Lab-developed tests get scrutiny 


BIOMEDICINE | The U.S. Food and Drug 
Administration (FDA) wants to start 

to regulate a slew of medical tests that 
now lack stringent oversight. These 
“lab-developed tests” are made and used 
within a single lab and include everything 
from predictors for Alzheimer’s disease to 
genetic analyses of tumors, but they have 
faced concerns about their validity. Under 
the new rule, which FDA proposed last 
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week, the testmakers would be required 
to submit data on their tests’ accuracy 

as medical device-makers do now. In 

a statement, FDA noted concerns over 
misdiagnoses and patients being over- or 
undertreated for common conditions such 
as heart disease. Commercial labs and 
academic medical centers that develop 
the tests argue that additional regulation 
will delay scientific advances. FDA will 
accept public comments on the rule until 
4 December. 


Good reviews for eLife model 


PUBLISHING | An analysis shows authors 
sharing their work in the nonprofit journal 
eLife report largely favorable experi- 
ences with the new publishing model it 
introduced in January, under which it 
posts manuscripts and associated peer 
reviews but no decision to accept or reject 
them. The selective, open-access journal, 
which focuses on the life sciences, aimed 
to boost the speed and transparency of 
publication. Authors can opt for the old 
model, but more than 90% of manuscripts 
submitted from February to July went 
through the new process, says the report, 
commissioned by eLife from an outside 
firm and released last week. The experi- 
ment has been closely watched, with 
some critics predicting it would drive 
away quality submissions. According to 
the report, total submissions declined 

by one-quarter, to 3223, compared with 
the same span of months in 2022. (eLife 
said it nearly reached its target.) Reviews 
were published within 79 days, on aver- 
age, of the preprint’s posting, down from 
an average of 232 days for publication 

of accepted papers under the old model. 
Seventy-one percent of reviewing editors 
said the quality of manuscripts stayed 

the same or improved, while only 42% of 
senior editors agreed. 


Biobanking endangered species 


CONSERVATION | A new partnership 
announced this week will preserve the 
cells of one-quarter of all threatened 

and endangered mammal species in the 
United States to aid in current and future 
recovery efforts. To date, scientists have 
cryopreserved tissue for only 14% of the 
approximately 1700 U.S. species listed 

as threatened or endangered. So the 
nonprofit Revive & Restore is funding a 
project with several partners to develop 
a biobank pipeline. In a pilot, biologists 
from the U.S. Fish and Wildlife Service 
are collecting tissue samples from 

24 endangered mammal species, including 
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A Preble’s meadow jumping mouse captured ina 
population survey provided skin cells for biobanking. 


the Preble’s meadow jumping mouse, the 
Mexican wolf, and the Sonoran pronghorn. 
The San Diego Zoo Wildlife Alliance and 
the cloning company ViaGen Pets & Equine 
will sequence the genomes and create 

cell lines, which will be preserved in the 
U.S. Department of Agriculture’s national 
repository for cryopreservation. Such speci- 
mens could be used to clone endangered 
species and boost the genetic diversity of 
wild populations. 


Grizzly reintroduction planned 


ECOSYSTEMS | The U.S. National Park 
Service and Fish and Wildlife Service are 
proposing to reintroduce a top predator 
to the North Cascades mountain range 
in Washington state. A draft plan, 
released last week, envisions establishing 


a population of 25 grizzly bears within 

5 to 10 years, with the population then 
naturally increasing to about 200. Grizzlies 
live in nearby mountain ranges in British 
Columbia, Idaho, and Washington, but 
highways and railroads prevent these bears 
from dispersing into the North Cascades. 
According to the plan, bears would be cap- 
tured from these populations and relocated 
by helicopter. Ranchers often oppose the 
recovery of grizzlies and other predators, 
so the plan lays out options for reducing 
conflict with ranchers and hikers, such as 
allowing the bears to be relocated or killed 
in certain circumstances. The plan is open 
for public comment until 13 November. 


Trust shapes views on science 


PUBLIC OPINION | Just 51% of Americans 
who express low trust in government and 
other institutions are fully vaccinated for 
COVID-19 versus 89% with high institu- 
tional trust, a survey has found. Although 
Republican respondents expressed less of 
this trust than Democrats overall, indi- 
viduals with high trust were more likely 
to get vaccinated than those with low 
trust even within each party, according to 
results from the online survey, released 
last week by the right-leaning American 
Enterprise Institute. Overall, trust in 
scientists fell during the pandemic: Only 
69% of the 5055 respondents expressed 
confidence that scientists would act in the 
public’s best interests, down from 86% in 
January 2019. 


DEVELOPMENTAL BIOLOGY 


How hammerhead sharks get their hammers 


ith their broad, flattened snout tipped at each end with giant google eyes, 
hammerhead sharks are both charismatic and easy to spot. But unlike most 
fish, hammerheads give birth to live young, so researchers have not been able 
to learn much about the development of any of the 10 or so species. Now for 
the first time, scientists have captured step by step how this “hammer” forms 
in developing embryos. Researchers at the University of Florida collected, imaged, and 
stained embryos of various stages from a small hammerhead species, the bonnethead 
shark, which were being studied for other reasons. Even though the shark’s head forms 
early on, the hammer doesn’t begin to appear until about 2 months later, about halfway 
through gestation, the team reported last week Developmental Dynamics. The hammers 
of other sharks in this group likely follow the same developmental pathway, the research- 
ers say. Next up: figuring out the genes involved in creating this unusual snout. 


Abonnethead shark embryo roughly 3 months into its 4- to 5-month gestation. 
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A health worker examines a young participant in a clinical trial of the R21 malaria vaccine in Kenya. 


Second malaria vaccine gets WHO green light 


New shots could make malaria protection more plentiful, saving tens of thousands of lives 


By Gretchen Vogel 


he world will soon have another 

powerful weapon against malaria, a 

disease that kills half a million peo- 

ple each year, most of them young 

children in sub-Saharan Africa. Data 

from a trial involving 4800 children 
in four African countries suggest that a vac- 
cine developed at the University of Oxford, 
known as R21/Matrix-M, provides signifi- 
cant protection against the disease. The re- 
sults were posted as a preprint hosted by 
The Lancet last week, and on 2 October, the 
World Health Organization (WHO) recom- 
mended R21 to prevent malaria in children 
living where the disease is endemic. 

The decision makes R21 the second WHO- 
approved vaccine for use against malaria. 
The first, called RTS,S/ASO1 or Mosquirix 
and made by GSK, was recommended for 
use in 2021 and has been given to 1.8 mil- 
lion children in Ghana, Malawi, and Kenya. 
Doses should arrive in nine more countries 
in Africa by the end of the year. But the 
18 million doses of that vaccine expected 
to be available between now and 2025 “are 
only about 10% of what we need” to protect 
the estimated 40 million children who are 
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born every year in malaria-affected areas, 
says Matthew Laurens, a malaria vaccine 
expert at the University of Maryland School 
of Medicine. 

R21 could soon be available in much 
larger quantities. It “has the potential to 
save tens of thousands of lives each year,’ 
Hanna Nohynek, chair of WHO’s Strategic 
Advisory Group of Experts on Immuniza- 
tion, said in a press conference announcing 
WHO’s recommendation. 

Like RTS,S, R21 requires multiple initial 
doses and a later booster. Both vaccines in- 
duce immunity with a protein called CSP 
from Plasmodium falciparum, the para- 
site that causes the deadliest form of ma- 
laria. The new vaccine contains a different 
immune-boosting agent, or adjuvant, and 
its viruslike particles are more thickly stud- 
ded with the CSP protein. It therefore re- 
quires a lower dose, which means it can be 
made more easily in larger quantities, says 
Adrian Hill, a vaccine expert at Oxford who 
led development of R21. 

The Serum Institute of India, one of the 
world’s biggest vaccinemakers, will manu- 
facture R21 and says it can produce more 
than 100 million doses of the vaccine per 
year starting in 2024. That would be “a 


leap forward” for efforts to fight malaria, 
Laurens says. “Having a second product 
would be fabulous.” And at an estimated 
price of $2 to $4 per dose, the new vaccine 
will likely be less expensive than RTS,S, 
which cost nearly $10 per dose in the first 
major purchase of the vaccine by UNICEF. 

The phase 3 R21 trial included chil- 
dren between the ages of 5 months and 
36 months at sites in Mali, Burkina Faso, 
Kenya, and Tanzania. Most were given 
prophylactic treatment with antimalarial 
drugs, also known as seasonal malaria 
chemoprevention, which is now standard 
in many malaria-affected regions. Children 
then received either R21 or a rabies vaccine 
as a control. At the three trial sites where 
malaria transmission is limited to the rainy 
season, the vaccine had 75% efficacy against 
clinical malaria over 18 months. At the two 
sites with year-round transmission, the ef- 
ficacy was 68% after 12 months. 

R21 efficacy was slightly higher in chil- 
dren who received their vaccines early, 
between the ages of 5 and 17 months. That 
could be a sign that the vaccine is less ef- 
fective in people who have already been ex- 
posed to malaria, Laurens speculates. If so, 
R21 might have lower efficacy in areas with 
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very high malaria incidence, where young 
infants are exposed at an early age, he says. 

The vaccine hasn’t yet been tested in 
regions that have high transmission year- 
round, the WHO recommendations note. 
Effectiveness should be monitored carefully 
in such settings during vaccine rollout, the 
agency says. No trials have compared the ef- 
ficacy of the two vaccines head to head, but 
so far R21 looks to be “as good, maybe even 
a little better, but not substantially better,” 
than RTS,S, says Robert Sauerwein, a ma- 
laria vaccine expert at Radboud University. 

For both vaccines, protection fades over 
time. In the R21 trial, children received 
three doses 4 weeks apart and a booster 
dose roughly 12 months after the third dose. 
It’s not clear how long R21’s protection 
might last, and ongoing trials are testing 
the efficacy of further boosters. Alassane 
Dicko, a malaria expert at the University of 
Science Techniques and Technologies of Ba- 
mako, who helped lead the R21 trial centers 
in Mali, says there are some indications that 
a booster every 2 years might be sufficient. 

R21 still needs to complete WHO’s so- 
called prequalification process, which as- 
sesses not only safety and efficacy, but also 
manufacturing practices and quality con- 
trol. That process is underway and is ex- 
pected to be completed in “the near future,” 
Kate O’Brien, director of WHO’s Depart- 
ment of Immunization, Vaccines and Bio- 
logicals said in the press conference. WHO’s 
experts predicted that R21 could be widely 
available for use in the second half of 2024. 

Which vaccines countries choose could de- 
pend on several things, O’Brien noted. R21 
may be less costly, at least at first, but much 
more is known about the safety and efficacy 
of RTS,S. So far, few serious side effects of 
R21 have emerged, but the phase 3 trial is too 
small to detect rare adverse events. 

At least 28 countries have plans to in- 
troduce vaccines as part of their malaria 
control programs, WHO says. The agency’s 
experts emphasized, however, that the vac- 
cine should be used in combination with 
already established malaria tools, including 
insecticide-treated bed nets, rapid diagno- 
sis and treatment, and antimalarial drugs to 
prevent infections. 

“Tt will be really important not to substi- 
tute the one intervention for the other,’ says 
WHO epidemiologist Mary Hamel, who has 
helped guide the introduction of RTS,S in 
Malawi, Kenya, and Ghana. “We don’t want 
to roll out the vaccine and pull back on bed 
nets, which are also lifesaving,” she told the 
press conference. “Adding these interven- 
tions on top of each other is where we're re- 
ally going to get high impact.” 


With reporting by Meredith Wadman. 
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Monarch butterfly is not 
endangered, new review finds 


IUCN shifts status to “vulnerable” after analysis challenged 


By Dennis Normile 


n an unusual reversal, the Interna- 

tional Union for Conservation of Nature 

(IUCN) has decided North America’s 

monarch butterfly is not “endangered.” 

Instead, the insect is only “vulnerable” 

to extinction, the group said last week— 
adding that it could lower the alarm still 
further, changing the listing to “near threat- 
ened” if an upcoming census suggests the 
population is stable or growing. 

The 27 September decision followed a re- 
searcher’s challenge to population models 
an IUCN team used to justify the endan- 
gered designation, conferred just 14 months 
ago. The team committed a “scientific injus- 
tice” by ignoring data showing monarchs 
are “doing really well,” argued ecologist 


tains. Each fall, butterflies in the larger east- 
ern population make an epic migration of 
up to 4000 kilometers from Canada and the 
northern United States to a forest in central 
Mexico, where they overwinter. Then, their 
descendants make a stepwise return to the 
north, taking up to four generations to com- 
plete the journey. The western population 
winters in Southern California along the 
Pacific coast and breeds along the Rockies. 
Researchers generally agree that the 
number of monarchs wintering in Mexico 
declined beginning in the 1990s or ear- 
lier but stabilized around 2014 at about 
55 million individuals. Many researchers 
blame the decline on two factors: logging 
in Mexico’s forests and farming in the U.S. 
and Canada that increasingly relied on 
crops modified to resist herbicides. That in- 


Each winter, monarch butterflies migrate up to 4000 kilometers to a forest in Mexico, where they overwinter. 


Andrew Davis of the University of Georgia. 
IUCN’s shift marks the latest twist in 
a scientific debate over the health of the 
showy black and orange insect. Monarchs 
are found worldwide, but the North Ameri- 
can subspecies, called the migratory mon- 
arch (Danaus plexippus plexippus), has 
become “a poster child of species conserva- 
tion because of its awesome ecology and mi- 
gration,” says ecologist Anurag Agrawal of 
Cornell University, who was not involved in 
IUCN’s assessment or the challenge. 
Migratory monarchs are split into two 
populations separated by the Rocky Moun- 


creased the use of herbicides that kill milk- 
weed, which monarch caterpillars feed on. 
A counterscenario, put forward by Davis 
and others, is that populations were anom- 
alously high decades ago thanks to large- 
scale clearing of forests for agriculture in 
the 1800s. The resulting open landscapes 
allowed milkweed and monarchs to flourish 
until the land shifted back to forest or in- 
tensive farming. Davis also notes that over- 
wintering populations have crashed in the 
past only to recover the following summer. 
In 2022, an IUCN team released an as- 
sessment that used annual counts of the 
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wintering population from 1993 to 2020, 
and two population models, to discern long- 
term trends in monarch numbers, which 
can vary year to year. One model rested on 
a “linear” method that assumed a constant 
rate of change over time. Modeling found 
that the population could have shrunk by 
22% to 72% over 10 years, meeting IUCN’s 
criteria for an endangered designation. The 
assessment also noted that the stability in 
overwintering monarch numbers seen since 
2014 made “current rates of decline less 
concerning than they were in years past.” 

Still, many scientists thought the “endan- 
gered” listing was warranted because drought 
along migration routes or cold winters could 
tip the population into an extinction spi- 
ral. “Monarch populations [are] at a level 
that most scientists suggest is not sustain- 
able,’ says Karen Oberhauser, a conservation 
biologist at the University of Wisconsin- 
Madison who was on the assessment team. 

But last week an IUCN panel ruled that 
the linear model “cannot be considered 
plausible.” Instead, it suggested a more 
complex model—published in February 
2020 in Frontiers in Ecology and Evolution 
by a team led by Wayne Thogmartin of the 
US. Geological Survey—is better. It shows 
the eastern monarch population reached an 
inflection point around 2014, with a steep 
decline giving way to a slower decline or a 
slight increase. Such results support a vul- 
nerable designation, the panel said. And if 
the upcoming winter census shows continu- 
ing stability, it said “the most plausible cat- 
egory would become near threatened.” 

IUCN’s ruling is not binding on its mem- 
ber nations, including the U.S., but it can 
influence policy. In 2020, US. officials con- 
cluded that legally designating the monarch 
as a threatened or endangered species was 
“warranted,” but not a priority. 

The new designation is drawing mixed 
reactions. Given the difficulty of forecast- 
ing future monarch populations, “the pre- 
cautionary principle would suggest keeping 
the listing at endangered, with the potential 
to uplist” later, says ecologist Leslie Ries of 
Georgetown University, who was not in- 
volved in the assessment or the challenge. 

Agrawal, who believes “the original list- 
ing of monarchs as endangered was in 
haste,” is less worried. And Davis says he 
was “actually hoping that [[UCN] would 
just list [the monarch] as ‘least concern,” 
its lowest threat level. He is considering 
submitting his own assessment to IUCN. 

In the meantime, Anna Walker, an 
entomologist at the New Mexico BioPark 
Society who led the IUCN panel that recom- 
mended the endangered listing, notes that 
the vulnerable listing “still indicates a high 
level of extinction risk.” 
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No U.S. shutdown, but dim 
outlook for science funding 


Congress faces strong pressure to hold down 2024 spending 


By Jeffrey Mervis 


.S. scientists bracing for a govern- 
ment shutdown that would have 
furloughed federal researchers and 
disrupted grantmaking are relieved 
that Congress averted a closure with 
a temporary spending measure passed 
last weekend. But Congress is still a long way 
from approving 2024 spending bills for re- 
search agencies. And scientists are likely to 
be disappointed with many final numbers. 

The so-called continuing resolution (CR) 
passed overwhelmingly on 30 September by 
both chambers allows agencies to operate 
until 17 November with spending at current 
levels. That means science agencies can con- 
tinue to accept and review grant applications 
and carry out routine business, although they 
are forbidden from starting any programs or 
reshuffling funds. 

If lawmakers can’t agree on a 2024 spend- 
ing plan by mid-November, however, they will 
need to pass another CR to avoid a shutdown. 
And many observers are betting that a con- 
tinued lack of consensus will trigger another 
legislative clash—and possible shutdown—in 
the days before the holiday season. 

For scientists, the bad news is that the 
chances are dwindling that research agencies 
will see any of the healthy budget increases 


that President Joe Biden has requested for 
the 2024 fiscal year that began on 1 October. 
And those agencies could fare much worse by 
the time the dust settles. 

One reason is that a late May deal to pre- 
vent the government from defaulting on its 
debt obligations also holds all nondefense 
discretionary spending—which includes civil- 
ian research—at this year’s levels rather than 
the 7% increase Biden has proposed. Defense 
would grow by 3%, as Biden has requested. 

But even that hard-fought compromise be- 
tween Biden and House of Representatives 
Speaker Kevin McCarthy (R-CA) is in serious 
trouble. Prodded by a group of conservative 
Republicans, the House last week passed four 
spending bills that would cut the budgets of 
several civilian agencies below 2022 levels. 
None covers such key research agencies as 
the National Institutes of Health (NIH) or the 
National Science Foundation. But the House 
is expected to approve similarly sharp cuts 
for those agencies in coming weeks. 

“I would not be surprised to see a bill 
come to the House floor that contains 
deep cuts to NIH,” says Ellie Dehoney of 
Research!America, which advocates for bio- 
medical research. 

Those low numbers won’t be the final 
word. The Senate has begun to advance its 
own versions of the 12 spending bills cover- 
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Another government shutdown looms in 6 weeks if 
Congress fails to pass a final budget. 


ing all government activities; they would give 
small increases to several research agencies. 
Those boosts are possible because the Sen- 
ate is assuming overall spending levels that 
are much higher than those supported by the 
House. The two chambers—and the White 
House—will need to find middle ground be- 
fore any of the appropriations bills can be 
passed and signed into law. 

Striking a compromise, however, will be 
complicated by McCarthy’s tenuous grip on 
his position as speaker of the House, where 
Republicans hold a slim majority. Many of 
the same group of conservative legislators 
who supported a shutdown and seek deep 
budget cuts also want to oust McCarthy as 
speaker. That leadership drama could com- 
plicate progress on spending bills. 

“Will McCarthy stick to his agreement with 
Biden or ... push for additional cuts?” asks 
one science lobbyist who requested anonym- 
ity. “I think the answer will depend on his 
strategy for holding onto his job.” 

Amid all the uncertainty, science advocates 
hail the CR as an example of what can happen 
when congressional Democrats and Republi- 
cans focus on getting something done rather 
than scoring political points. For example, 
McCarthy dropped proposed language on 
such divisive issues as abortion and immigra- 
tion to come up with a bill that Democrats 
would support. And Senate Majority Leader 
Chuck Schumer (D-NY) reluctantly removed 
an additional $6 billion to support Ukraine’s 
military to accommodate some Republicans. 
The final votes on the “clean” CR—335 to 
91in the House and 88 to nine in the Senate— 
reflect a willingness to call a truce to the 
fierce partisanship, says Matt Hourihan of 
the Federation of American Scientists. 

“As a bipartisan measure, the CR further 
demonstrates that compromise is not only 
possible, but necessary,’ Hourihan says. 
“There's a long road ahead, but I think it sets a 
good precedent that legislators can build on.” 

One early test could come if Congress tries 
to pass a separate bill with Ukraine fund- 
ing. Pundits predict that winning House 
support for this “supplemental” could re- 
quire including funds for border security 
and restrictions on immigration. Given the 
consensus on freezing civilian spending, 
however, a successful supplemental could 
make it even harder for legislators to boost 
research budgets. 

Such uncertainty makes it difficult to 
predict when Congress might ultimately 
agree on a 2024 budget. “I think the next 
45 days are going to be very fluid,” Dehoney 
says. “So I’d be wary of anybody offering up 
any predictions.” m 
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Third retraction looms for 
superconductivity physicist 


Ranga Dias'’s co-authors request retraction after doubts 
cast on key data in blockbuster room-temperature claim 


By Dan Garisto 


acing a mutiny by his co-authors, 

Ranga Dias, the University of Roches- 

ter (U of R) physicist embroiled in con- 

troversy over his superconductivity 

research and allegations of scientific 

misconduct, is set to have a third pa- 
per retracted. 

In the 8 March paper in Nature, Dias and 
his colleagues claimed to have discovered 
a material that could conduct electricity 
without resistance at room temperatures 
and moderate pressures. If the paper is 
retracted, it will follow retractions of two 
other claims from Dias’s teams: one last 
month from Physical Review Letters (PRL), 
and one in September 2022 of a different 
Nature paper. 

On 1 September, Nature attached an edi- 
tor’s note to the March paper, warning read- 
ers that “the reliability of data presented in 
this manuscript is currently in question.” A 
week later, eight of the co-authors on the 
1l-person paper submitted a letter to Na- 
ture requesting the study be retracted, The 
Wall Street Journal first reported last week. 
Science has obtained the letter and addi- 
tional documents, which raise concerns 
about the reliability of the data and Dias’s 
treatment of his co-authors. “We respect- 
fully request and recommend that Nature 
issue a retraction,’ conclude the signatories, 
who include five recent graduate students 
of Dias’s. 

The documents indicate Nature plans to 
retract the paper. In a 14 September email 
to the co-authors, Nature Senior Editor 
Tobias Rédel says a postpublication re- 
view revealed issues that are “mostly 
unaddressed.” Rédel added, “We are in ab- 
solute agreement with your request that the 
paper be retracted.” Responding to Science, 
Nature’s Chief Physical Sciences Editor Karl 
Ziemelis said the journal is “carefully” in- 
vestigating concerns. “We expect to take ac- 
tion in the near future.” 

In emails throughout the postpublication 
review process, Dias reiterated his “un- 
wavering confidence in the integrity of our 
data.” He did not respond to a request for 
comment. “There are not many new facts 


in the big picture, but the brazenness and/ 
or incompetence is remarkable,” says Peter 
Armitage, a condensed matter physicist at 
Johns Hopkins University who reviewed the 
documents obtained by Science. 

The March paper drew headlines be- 
cause of the potential applications for a 
room-temperature superconductor, such 
as lossless electrical grids and powerful 
magnets for trains and medical imaging. 
In the study, Dias and his colleagues re- 
ported that a compound of lutetium, ni- 
trogen, and hydrogen superconducts at 
temperatures up to 294 K when squeezed 
to a comparatively moderate pressure of 
10,000 atmospheres. (Dias had made earlier 


Ranga Dias is facing another retraction for his work 
on room-temperature superconductors. 


claims of creating near-room-temperature 
superconductors, but only at pressures of 
more than 1 million atmospheres.) 

Questions about the paper arose immedi- 
ately after its publication. On 2 May, two re- 
searchers submitted an anonymous critique 
of the paper to Nature. These researchers 
disclosed their identities to Science: They are 
James Hamlin, a high-pressure experimen- 
talist at the University of Florida, and Brad 
Ramshaw, an expert in superconductivity 
at Cornell University. 

Hamlin and Ramshaw focused on data 
supporting the observation of zero elec- 
trical resistance, a hallmark of super- 
conductivity. Although the paper presented 
figures showing the resistance dropping 


6 OCTOBER 2023 + VOL 382 ISSUE 6666 19 


NEWS | IN DEPTH 


to about zero, the authors acknowledged 
modifying the data by subtracting a back- 
ground signal. Background subtraction is 
sometimes done in high-pressure super- 
conductivity studies, but the practice is 
controversial because it can give the false 
appearance of zero resistance. 

In their critique, Hamlin and Ramshaw 
pointed out that the paper describes a back- 
ground subtraction method that produces 
a resistance plot drastically different from 
the published version. On 28 May, Dias 
and the other senior author on the paper, 
Ashkan Salamat, a physicist at the Univer- 
sity of Nevada, Las Vegas, provided Nature 
editors with updated instructions on the 
background subtraction method. But ap- 
plying this method still produced a figure 
different from the published plot. Essen- 
tially, they could not explain how they ar- 
rived at a plot that showed zero resistance, 
Hamlin says. “The problem 
is they’re not able to answer 
the question of how much 
they subtracted off.” 

In response to the critique, 
Nature initiated a_ post- 
publication review process, 
soliciting feedback from four 
independent experts. In doc- 
uments obtained by Science, 
all four referees expressed 
strong concerns about the 
credibility of the data. ‘ 
fail to understand why the 
authors ... are not willing 
or able to provide clear and 
timely responses,’ wrote one 
of the anonymous referees. 
“Without such responses the credibility of 
the published results are in question.” A sec- 
ond referee went further, writing: “I strongly 
recommend that the article by R. Dias and A. 
Salamat be retracted.” 

Most of the paper’s co-authors were not 
included in these initial deliberations until 
6 July, at which point Dias provided them 
with a PDF of the postpublication peer review. 
In their letter to Nature the co-authors allege 
that they had raised concerns about the study 
prior to publication, but that Dias dismissed 
them. They say Dias gave some of them an 
ultimatum: Remove their names from the 
paper or allow it to proceed. “Neither choice 
seemed tenable given that Dr. Dias was in 
control of our personal, academic, and finan- 
cial circumstances,” they write. “We did not 
feel that we were able to speak freely.” 

After seeing the postpublication review, 
the co-authors used their access to the raw 
data to corroborate Hamlin and Ramshaw’s 
concerns about the zero-resistance mea- 
surement. They found that the paper relied 
on a method to determine the resistance 
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“The students 
are courageous for 
voicing their 
concerns and doing 
the right thing, 
and the scientific 
1 community should 

protect them.” 


Shanti Deemyad, 
University of Utah 


that was not discussed or disclosed to the 
co-authors and did not match the raw data. 
They also argued that another key piece 
of data presented as evidence of super- 
conductivity, the heat capacity of the sam- 
ple, was a “flawed measurement.” 

Before the co-authors sent their letter to 
Nature, Dias sent a cease-and-desist letter 
to six of them—five of his former graduate 
students and one U of R faculty member. He 
warned them about the “potential legal con- 
sequences of your actions and to consider 
the ethical implications of making baseless 
allegations against a colleague and fellow 
scientist.” Dias has previously sent cease- 
and-desist letters to other critics of his work. 
“Raising concerns about data and processes 
is an integral part of our scientific practice,” 
says Shanti Deemyad, a condensed matter 
physicist at the University of Utah. “The 
students are courageous for voicing their 
concerns and doing the right 
thing, and the scientific com- 
munity should protect them.” 

All co-authors signed the 
letter to Nature, except Dias 
and two of his U of R stu- 
dents. The signatories also 
include Salamat, a longtime 
collaborator who with Dias 
co-founded a spin-off com- 
pany, Unearthly Materials, 
to commercialize super- 
conductors. Earlier this year, 
during PRI’s investigation 
of its now-retracted paper, 
Salamat provided what the 
journal concluded was false 
raw data in “a deliberate at- 
tempt to obstruct the investigation.” Salamat 
did not respond to requests for comment. 

Dias is also under investigation by Wash- 
ington State University for allegedly plagia- 
rizing more than 20% of his Ph.D. thesis. 
He has previously claimed to be “addressing 
these issues directly with his thesis adviser.” 
U of R had already conducted two internal 
investigations of Dias’s research that it says 
found no evidence of misconduct. However, 
a U of R spokesperson says a third investi- 
gation is now underway by external experts 
who are looking into “the integrity of data 
across multiple papers.” 

A source who requested anonymity says 
that until that investigation has concluded, 
Dias is not allowed to supervise under- 
graduate and graduate students. The U of 
R spokesperson would not confirm whether 
Dias has also been stripped of his teaching 
duties. But the university’s course website 
shows that for this fall term, there are zero 
students enrolled in Dias’s classes. 


Dan Garisto is a science journalist in New York City. 
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Astronomers 
may have 
spied second 
Kuiper belt 


New objects found beyond 
Neptune during hunt for 
targets for NASA spacecraft 


By Paul Voosen 


here just doesn’t seem to be enough of 
the Solar System. Beyond Neptune’s or- 
bit lie thousands of small icy objects in 
the Kuiper belt, with Pluto its most fa- 
mous resident. But after 50 astronomi- 
cal units (AU)—50 times the distance 
between Earth and the Sun—the belt ends 
suddenly and the number of objects drops to 
zero. Meanwhile, in other solar systems, simi- 
lar belts stretch outward across hundreds of 
AU. It’s disquieting, says Wesley Fraser, an 
astronomer at the National Research Council 
Canada. “One odd thing about the known So- 
lar System is just how bloody small we are.” 

A new discovery is challenging that pic- 
ture. While using ground-based telescopes to 
hunt for fresh targets for NASA’s New Hori- 
zons spacecraft, now past Pluto on a course 
out of the Solar System, Fraser and his col- 
leagues have made a tantalizing, though pre- 
liminary, discovery: about a dozen objects 
that lie beyond 60 AU—nearly as far from 
Pluto as Pluto is from the Sun. The finding, if 
real, could suggest that the Kuiper belt either 
extends much farther than once thought or— 
given the seeming 10-AU gap between these 
bodies and the known Kuiper belt—that a 
“second” belt exists. 

The discovery, being prepared for publica- 
tion and not yet peer reviewed, is supported 
by measurements from New Horizons itself, 
which at 57 AU continues to streak beyond 
the edge of the known Kuiper belt. Many of 
its instruments are in hibernation, but a dust 
counter has run continuously during the mis- 
sion. Dust is a telltale sign of colliding plan- 
etary bodies, and so the New Horizons team 
expected the amount of dust to fall off steeply 
after the probe left the Kuiper belt, where it 
had rendezvoused with an object called Ar- 
rokoth. Instead, “The number of impacts is 
not declining,’ says Alan Stern, the mission’s 
principal investigator and a planetary sci- 
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entist at the Southwest Research Institute. 
“And the simplest explanation for that is that 
there is more stuff out there that we haven’t 
detected.” 

Astronomers outside the team are in- 
trigued but want to see more evidence. “If 
there really is a new belt, that’s a super- 
exciting thing,” says Pedro Bernardinelli 
of the University of Washington. However, 
the findings are hard to reconcile with un- 
published results from a recent survey of the 
outer Solar System using the 4-meter Victor 
M. Blanco Telescope in Chile. It surveyed a 
different slice of sky and turned up only one 
object beyond 50 AU. Other surveys have 
also come up dry. “Why are we not seeing 
these things?” Bernardinelli asks. “Did every- 
one get unlucky? It’s possible, but it’s hard.” 

Stern would welcome more icy bodies to 
study. After threats to shift New Horizons 
from planetary science to heliophysics— 
the study of the Sun and its plasma-filled 
envelope—NASA relented last week, extend- 
ing the mission’s current 
focus until the decade’s end. 
That means the probe could 
still visit another object like 
Arrokoth, if one exists, Stern 
says. “We have the opportu- 
nity to do science that we 
know we can do—and can’t 
be done other ways—for a 
very long time.” 

For years astronomers 
have scouted targets for the 
mission using a wide-field 
camera on Japan’s 8.2-meter 
Subaru Telescope, on Ha- 
waii’s Mauna Kea. The giant 
telescope is needed, Fraser 
says, because Kuiper belt ob- 
jects, so faint and slow mov- 
ing, “are a pain in the butt 
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to find” Targets for New Horizons are even 
harder to see because the spacecraft, as seen 
from Earth, appears to be flying straight to- 
ward the bright center of the Milky Way, daz- 
zling instruments. 

To boost the signal, the team electroni- 
cally combines hundreds of images from a 
night of observation, hoping a bright blob 
will appear along a likely trajectory. Ini- 
tially, the team examined the images manu- 
ally, sifting through 15,000 candidates a 
night. “You go blind pretty quickly,’ Fraser 
says. Now, artificial intelligence makes the 
screening much less painful. “We went from 
a week of agony for an army of people to 
6 hours of vetting.” 

The result was the 12 far-out objects. 
They add to a clue: three blips spotted by 
star-tracking sensors on the Hubble Space 
Telescope in its first 20 years, says Hilke 
Schlichting, an astrophysicist at the Uni- 
versity of California, Los Angeles. The sen- 
sors, which stare unendingly at single stars 


Loosening the belt 
Since 2021, astronomers have found hundreds of new objects in the Kuiper belt, a region 
beyond Neptune. Most sit within 30 and 50 astronomical units (AU, blue). But 12 lie past 
60 AU (red)—suggesting NASA's New Horizons mission could have more objects to explore. 


A dozen distant objects spotted 
by the Subaru Telescope in 
Hawaii are challenging theorists. 


to help with pointing the telescope, briefly 
dimmed for a fraction of a second, suggest- 
ing the stars were eclipsed by unknown Kui- 
per belt objects crossing in front of them. 
“Ts there a larger population beyond 60 AU? 
There could be,’ Schlichting says. “Maybe 
that’s what we're seeing. I’m not sure.” 

The team has done its best to confirm the 
objects’ orbits, by capturing them at three 
or four spots along their apparent courses. 
They think one reason the Blanco telescope 
hasn’t turned up similar results might be 
because the objects are clustered near New 
Horizons’s path, perhaps by Neptune’s grav- 
ity, Fraser adds. 

Just as intriguing as the new objects is the 
apparent gap between 50 and 60 AU, says 
Mihaly Horanyi, a space physicist at the Uni- 
versity of Colorado Boulder who oversees 
New Horizons’s dust counter. “One way or 
another, something is responsible for main- 
taining that gap.” In other solar systems, 
planets orbiting within a dusty disk carve 
gaps by hoovering up ma- 
terial. But no large planet 
has been seen in the gap. 
The gap could also be a relic 
from the Solar System’s in- 
fancy, caused by waves of 
pressure in the disk. 

The team returned to 
Subaru for several days last 
month, using a new optical 
filter that Fraser says will let 
them see fainter and smaller 
objects. They are now ana- 
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lyzing the data. If the second 
Kuiper belt is real, more than 
a dozen new distant bodies 
should emerge, he says. “And 
if that expectation isn’t met, 
then we have completely 
missed something.” 
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Award honors pair for mRNA 
work key to COVID-19 vaccines 


Katalin Karik6 and Drew Weissman overcame a major 
roadblock to a novel immunization approach 


By Catherine Offord and Jon Cohen 


ver the past 3 years, COVID-19 has 
provided a life-and-death testing 
ground for a previously unproven vac- 
cine technology. Relying on strands 
of RNA to rouse the immune system 
against SARS-CoV-2, the approach 
resulted in shots that saved countless lives 
and protected millions from serious illness. 
Now, Katalin Karik6 and Drew Weissman 
have been awarded this year’s Nobel Prize in 
Physiology or Medicine for messenger RNA 
(mRNA) research key to the novel COVID-19 
vaccines—and their insights could lead to 
medicines for many other diseases. 

The prize, announced this week, recognizes 
the pair’s pioneering studies of mRNA, the 
genetic strands encoded by DNA to help in- 
struct cells how to assemble proteins. Work- 
ing together at the University of Pennsylvania 
in the early 2000s, the two found a way to 
overcome a major obstacle to using labmade 
mRNA in medicine: the fierce inflamma- 
tory responses it provokes in the body. They 
identified specific chemical changes to the 
molecules that tame those responses, paving 
the way for delivering mRNA into human 
cells to make viral proteins. Moderna and 
the Pfizer-BioNTech collaboration both used 
the approach to quickly make COVID-19 vac- 
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cines that began to protect people around the 
world at the end of 2020. 

“This is a wonderful choice of Nobel Prize,” 
says Anthony Fauci, former director of the 
US. National Institute of Allergy and Infec- 
tious Diseases, who had Weissman as a fellow 
in his lab. “[It’s] the classic example of a years 
and years-long collaboration on a very, very 
difficult problem. ... They doggedly kept at it 
and made a discovery that has already trans- 
formed many areas of biomedical research.” 

Traditional vaccines typically contain a 
weakened or killed version of a pathogen, 
key pathogen proteins made in the lab, or 
viruses genetically engineered to make those 
proteins in the body. In contrast, mRNA in- 
structs cells to make pathogen proteins that 
trigger protective immune responses. The 
approach promised an efficient way to make 
vaccines, as well as therapeutic proteins—if 
the potentially harmful immune responses 
could be forestalled. 

Karik6, a biochemist born in Hungary, 
and Weissman, a U.S.-born immuno- 
logist, realized that natural mRNAs, un- 
like the lab versions, have various chemical 
modifications that prevent this problem. In 
2005, they reported in the journal Jmmu- 
nity that replacing an mRNA base called 
uridine with naturally occurring, modified 
bases such as pseudouridine could greatly 
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Katalin Karik6 (left) and Drew Weissman’s long-te upe 


collaboration paid off in lifesaving vaccines. 


reduce the inflammatory response. They 
also showed that the change could increase 
the amount of protein made by cells, gen- 
erating a strong protective response. Mod- 
erna and the Pfizer-BioNTech collaboration 
used these insights to create mRNA vac- 
cines evoking immunity to SARS-CoV-2 
spike protein. (BioNTech employed Karik6é 
until recently.) 

It’s still unclear whether mRNA vaccines 
generally offer better protection than other 
proven vaccine approaches. But they can 
be faster and easier to produce. Developers 
simply synthesize the genetic sequence for 
the desired protein and encase it in a protec- 
tive lipid nanoparticle. Proteins, viruses, and 
bacteria, in contrast, need to be produced in 
bioreactors that are prone to contamination 
and poor yields. 

Kariko and Weissman struggled to find 
the resources to develop their mRNA work 
clinically. Karik6 couldn’t win funding from 
the National Institutes of Health, and she 
didn’t have a research team when she was 
doing the key studies, forcing her to do the 
biochemistry experiments with her own 
hands. “I am so unlikely a person to win the 
Nobel Prize,” she says. 

Karik6, now a professor at the Univer- 
sity of Szeged, is just the 13th woman to be 
awarded the Nobel Prize in Physiology or 
Medicine in its more than 120-year history. 
“(Her] grit and determination to follow the 
science, despite many telling her she could 
not succeed, are an inspiration to all,’ says 
Jessica Foster, an oncologist at the Children’s 
Hospital of Philadelphia. 

There’s optimism that mRNA medicines 
will move beyond preventing diseases and 
into treating them. Companies and academic 
labs are clinically testing mRNAs that pro- 
duce therapeutic proteins in the body for con- 
ditions ranging from cystic fibrosis to heart 
disease. Foster, for example, has worked with 
Kariko on mRNAs that stimulate antitumor 
immune responses. 

Olle Kampe, vice chair of the commit- 
tee that chose the winners of this year’s 
award, hopes the honor will increase use of 
COVID-19 vaccines. The Nobel imprimatur, 
he suggests, “can make hesitant people take 
the vaccine and be sure that it’s very efficient 
and it’s safe.” 

Soumya Swaminathan, who served as 
chief scientist at the World Health Organi- 
zation until 2022 and now chairs a founda- 
tion in Chennai, India, says there is no doubt 
about the long-term impact of the laure- 
ates’ work. “The success of mRNA vaccines 
against COVID has now opened up a whole 
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Sculptors of short light pulses win physics Nobel 


Researchers share prize for attosecond laser flashes that capture the motion of electrons 


By Adrian Cho and Daniel Clery 


his year’s Nobel Prize in Physics hon- 

ors three scientists who developed 

techniques to glimpse the movement 

of electrons in atoms and molecules 

using flashes of laser light lasting just 

attoseconds—billionths of a billionth 
of a second. 

Pierre Agostini of Ohio State University, 
Ferenc Krausz of the Max Planck Institute 
of Quantum Optics, and Anne L’Huillier of 
Lund University share the prize of roughly 
$1 million for experiments that “enabled the 
investigation of processes that are so rapid 
they were previously impossible to follow,” 
the Royal Swedish Academy of Sciences an- 
nounced Tuesday morning. 

“The Nobel committee made the right 
choice,’ says Mauro Nisoli, head of the atto- 
second research center at the Polytechnic 
University of Milan. “Now, this is a big topic 
with many active researchers, but these re- 
ally were the pioneers.” 

LHuillier produced such pulses first, fol- 
lowing a fortuitous discovery. In 1988, she 
and her colleagues shined an intense infra- 
red laser through a noble gas and found that 
the gas emitted ultraviolet light with frequen- 
cies that were multiples of the frequency of 
the original laser light. Over the next several 
years, LHuillier and colleagues explained 
how such “high harmonics” emerge as the la- 
ser light rips an electron from a gas atom and 
then shoves it back in, causing it to emit light. 

Those high harmonics are spread across 
a range of frequencies differing by factors of 
30 or more—just what you need to make a 
very short pulse of light. Fundamentally, any 
short pulselike wave must contain a spread 
of frequencies—and the shorter the pulse, the 
wider the spread. It’s the same reason why 
the information-packed pulses of high-speed 
internet require “broadband” wavelengths. 
In the 1990s, L’'Huillier and colleagues dem- 
onstrated that the high harmonics can be 
combined in such a way that the waves in- 
terfere with each other, producing a train of 
shorter pulses where the peaks coincide. 

Theory predicted that those pulses would 
be just attoseconds long. But physicists had 
no way to prove that until Agostini and his 
colleagues tackled the problem in 2001. He 
and his team used an argon gas to generate 
a train of ultrashort pulses and shined both 
those pulses and part of the original laser 
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pulse that spawned them onto another argon 
sample. The researchers then took advantage 
of a prediction, based on interference effects, 
that the number of electrons emerging at a 
particular energy would depend on how the 
two laser pulses overlapped in time. They 
could vary the overlap by, say, moving a mir- 
ror in their optical set up by a micrometer 
or so. Using this now-standard technique, 
Agostini and colleagues showed that the 
pulses were just 250 attoseconds long. 

Krausz pushed attosecond pulses further 
by creating solitary pulses that can be more 
useful for experiments. In 2001, he and his 
team produced a lone pulse 650 attoseconds 
long. To do that, the researchers had to first 
produce a very wide range of harmonics 
and then use a filter to select and combine 
just a few of the highest ones. 


kind of a spectacular technology looking for 
a killer app.” But Krushelnick notes that it 
also took lasers decades to fulfill their initial 
promise. 

Some physicists are working to extend 
attosecond techniques to other parts of the 
electromagnetic spectrum. Researchers have 
now coaxed attosecond pulses out of the 
Linac Coherent Light Source, an enormous 
x-ray free-electron laser at SLAC National Ac- 
celerator Laboratory, says Linda Young, an 
atomic physicist at Argonne National Labo- 
ratory. In principle, ultrashort x-ray pulses 
would have enough energy to probe atoms 
and molecules with much more strongly 
bound electrons. 

LHuillier is just the fifth woman to 
win the Nobel Prize in Physics. “L’Huillier 
was instrumental and for a long time she 


The work of Anne L’'Huillier, Ferenc Krausz, and Pierre Agostini (left to right) captures fast chemical processes. 


In the past 2 decades, applications of atto- 
second science have begun to grow. Scientists 
can measure how long it takes for a pulse to 
tug an electron away from an atom—a sign 
of how tightly it is bound—and see how 
collections of electrons move from place to 
place inside molecules or materials, key to 
understanding their properties. Researchers 
have also used attosecond pulses as ultrafast 
electronic switches, flipping a material from 
an insulator to a conductor. Attosecond light 
pulses may even help diagnose disease in 
blood samples, by giving molecules a shove 
and provoking a telltale response. 

So far the techniques have produced few 
major discoveries, says Karl Krushelnick, a 
physicist at the University of Michigan. “It’s 


didn’t get the recognition for it,” says Ursula 
Keller, head of the ultrafast laser group at 
ETH Ziirich. 

Speaking by phone during the press con- 
ference announcing the prize, LHuillier said 
there is plenty of fertile terrain left to ex- 
plore. One goal would be to use attosecond 
pulses not just to investigate the movement 
of electrons during chemical processes, but 
to control them, perhaps driving a reaction 
that nature doesn’t prefer. “The dream—if 
you want, the Holy Grail—of this part of our 
research is to be able to control the initial 
time for a molecular reaction,’ LHuillier 
said. “But this is in the future.” & 


With reporting by Phie Jacobs. 
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FAMILY TIES 


Giant family trees based on ancient DNA from thousands of people 
are revealing prehistoric politics and social structure 


he burial jar, found under the floor 

of a mountaintop citadel called La 

Almoloya in southeastern Spain, 

held a puzzle. Almost 1 meter in 

diameter, the vessel entombed 

a woman in her late 20s with 

a shining silver diadem on her 

forehead. She also had silver ear- 

plugs threaded through with silver 

hoops, an awl covered in silver—and a com- 

panion: a middle-age man laid to rest in 

the same jar with a fraction of her wealth. 

The pair were likely prominent members 

of a Bronze Age protostate called El Argar, 

which dominated much of the Iberian Pen- 

insula from hilltop strongholds for nearly 
700 years, beginning around 2200 B.C.E. 

When archaeologists first excavated these 

tombs more than a century ago and found 

many women and men buried together 

in large jars, they assumed the pairs were 

royal couples. But given the wealth gap be- 

tween women and men, later researchers 


24 6 OCTOBER 2023 + VOL 382 ISSUE 6666 


By Andrew Curry 


concluded the jars held relatives who died 
years apart. They thought the women’s rich 
adornments suggested a matriarchy, where 
powerful women were buried and their sons 
or grandsons added to their tombs later. 

Then, in 2019, a team of geneticists and 
archaeologists extracted DNA from the La 
Almoloya woman and her companion, along 
with 66 other people buried in pairs and sin- 
gly in the walls and under floors of the hilltop 
houses. Rather than ancestors and descen- 
dants, the analysis showed, the couples were 
partners. In three cases, children were buried 
near their parents, who were buried together; 
one baby girl was the daughter of the woman 
and man buried in the palace. “We did not 
expect that,’ says co-author Roberto Risch, an 
archaeologist at the Autonomous University 
of Barcelona. “But [the couples] must have 
been contemporary, because they had off- 
spring together.” 


The results also revealed that the elite 
women seemed to be transplants, perhaps 
from other high-status hilltop settlements. 
Unrelated to others buried at the site, the 
women married men who were related to 
each other and apparently local, perhaps 
as a way to knit far-flung El Argar settle- 
ments into a cohesive early state. “Probably 
women were crucial in social networking,” 
Risch says. “It’s clear the culture’s protago- 
nists are women.” 

The results from La Almoloya, published 
last year, are part of a surge of new stud- 
ies that are shifting the focus of ancient 
DNA research from genetic links between 
populations toward intimate, interpersonal 
connections. As the cost of DNA sequencing 
has plummeted, researchers have started 
to sequence genomes from many people 
at a site, revealing the structure of ancient 
communities. “It’s gotten so cheap that 
you can do whole cemeteries,’ says com- 
putational biologist Harald Ringbauer of 
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the Max Planck Institute for Evolutionary 
Anthropology (EVA). 

Combined with more _ traditional 
archaeological data, this tsunami of ancient 
DNA promises unprecedented insights into 
prehistoric family practices, group identity, 
and power. “We can infer not only biological 
kinship, but social practices,” says geneticist 
Vanessa Villalba-Mouco of EVA. “We can 
understand marriages, maybe, in the past.” 

Already, geneticists and archaeologists 
have teamed up to dissect the marriage 
patterns of Bronze Age steppe nomads and 
traced thousands of living descendants of 
enslaved women buried 200 years ago in 
a forgotten Maryland cemetery. Forthcom- 
ing ancient family trees, presented at two 
recent conferences and as preprints, prom- 
ise to reveal not just siblings and parent- 
child relationships, but distant cousins 
10 degrees removed, such as at an Iron Age 
English site where the burials of 170 rela- 
tives spanning 10 generations are grouped 
by maternal ancestry. At the International 
Society for Biomolecular Archaeology meet- 
ing last month, researchers reported trac- 
ing kinship ties among hundreds of people 
from a remote Polynesian island, showing 
that millennia-old social practices on the is- 
land shifted along with new arrivals. 

“There’s extraordinarily exciting stuff 
happening, and it’s not just geneticists who 
are excited about it,” says Australia National 
University archaeologist Catherine Frieman. 
“We're finally at a point where you can actu- 
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ally ask archaeological questions about this 
data,” like how people in the past organized 
societies and venerated ancestors. 

The field “is really blooming,’ agrees 
Zuzana Hofmanova, an EVA geneticist who 
is working on reconstructing the social struc- 
ture of medieval Moravia using the bones 
of people buried under Czech and Slovak 
churches. “We're just scratching the surface. 
There’s a lot to be done, and it’s really fun.” 


AFTER THE FIRST COMPLETE ancient human 
genome was published in 2010, demon- 
strating the power of ancient DNA tech- 
nology, a wave of studies mapped ancient 
migrations and population shifts. The work 
drew on samples widely scattered in time 
and space, because sequencing ancient 
genomes was at first so expensive that re- 
searchers often sampled just one or two 
individuals per site. Geneticists focused on 
broad patterns of similarity in the DNA, and 
archaeologists seeking fine-grained analyses 
of social structure and culture were some- 
times sidelined. “We told stories from a bio- 
logical perspective that were only indirectly 
related to actual history,’ says Joachim 
Burger, a geneticist at Johannes Guten- 
berg University Mainz. “Archaeologists 
always hated us for that, and they’re abso- 
lutely right.” 

Today, sequencing can be done at a frac- 
tion of what it cost a decade ago, and often 
yields more information per sample. Re- 
searchers can analyze well-preserved DNA 
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DNA revealed the connection between a woman 
and man buried in a jar (right) in the Bronze Age 
fortress of La Almoloya in Spain (left). 


from an ear bone for as little as a few hun- 
dred dollars—about the same price as get- 
ting a radiocarbon date. Based on shared 
stretches of DNA, studies can identify 
siblings and link grandparents to grand- 
children, tracing the chronology of a cem- 
etery in a way traditional radiocarbon dat- 
ing, accurate to several decades at most, 
cannot. “We often tell archaeologists to wait 
and do radiocarbon dating after we put to- 
gether the pedigree,’ says University of the 
Basque Country geneticist Ifigo Olalde. 

In the early days of ancient DNA, such kin- 
ship information was considered noise. As a 
Ph.D. student in 2015, Olalde recalls working 
out a way to identify first-degree relation- 
ships among ancient individuals—in order 
to weed siblings and parents out of his data 
sets. “If you want to understand how people 
move, you want as many unrelated individu- 
als as possible. A family is just the same DNA 
all over again,” Olalde says. “We weren’t par- 
ticularly interested in families.” 

He was surprised to find archaeologists 
felt differently. As ancient DNA technology 
improved, “we realized there was a lot of 
potential ... it wasn’t just father-son, or a 
couple of siblings,” he says. “It was whole 
families, over multiple generations.” Soon, 
he was partnering with archaeologists to 
spin samples originally gathered for sprawl- 
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ing population studies into papers focused 
on prehistoric kinship. 


IN THE LATE 1970S, dozens of skeletons 
were found under a 50-meter-long barrow 
called Hazleton North, built 5700 years ago 
by some of the earliest farmers in Great 
Britain—part of the same population that 
began to construct Stonehenge 700 years 
later. For decades, archaeologists wondered 
whether Hazleton and similar mounds 
were like family crypts or functioned more 
as shared shrines, collecting biologically 
unrelated members of the community. 

DNA from the tomb has now provided 
clear answers: Hazleton was a family af- 
fair. Working with a team that included 
Newcastle University archaeologist Chris 
Fowler, Olalde showed the tomb contained 
27 relatives spanning five generations, as 
well as seven unrelated people. Fifteen of 
the men in the tomb were related by de- 
scent to a single man, whereas none of the 
adult women were. Although the family 
tree included two young girls, adult women 
descended from the founder were conspic- 
uously missing (see graphic, below). 

That suggests that among these Neo- 
lithic Britons, women were buried with the 
family of their mates, not their parents—an 
echo of arrangements in the later El] Argar 
culture that opens the possibility that the 
people at Hazleton were patrilocal, too. “In 
the absence of genetics, we just couldn’t 
tell,” Fowler says. Now, “We can combine 
biological sex with what we know about 
age to see that socially this was something 
significant.” Archaeologists had speculated 
about patrilocality in past societies based 
on ethnographic evidence and clues from 
bone chemistry, but had never been able to 
conclusively demonstrate it. Fowler hopes 
studies of isotopes in the bones, which can 
serve as a geographic fingergprint, can 


Areconstruction of the 5700-year-old Hazleton North barrow in England, in which dozens of family members 
were buried over generations. 


show how those men and women moved 
during their lifetimes. 

Even though the women at Hazleton 
were unrelated, they apparently helped 
structure the extended families. The fam- 
ily’s founding father had four female part- 
ners. They were apparently from different 
generations, because a later woman in 
the tomb had children with the founder’s 
son as well as his grandson by a different 
partner, suggesting the son and grandson 
were somewhat similar in age. So the team 
guesses the founder’s four unions were se- 
quential, spread over his lifetime.) 

Over the course of a century or more, 
burials were organized according to the 
female line. Two women and their descen- 
dants were buried in the tomb’s southern 
chamber, whereas offspring of the other 
women were buried in a separate passage 
on the other side of the mound. “When 
you look at the layout ... it seems to re- 
ally matter which of those first generation 
women you were related to, and it seems 
to have been followed for five generations,” 
Fowler says. “That male lineage unifies the 
tomb, whereas female ancestors subdivide 
the community.” 


This summer, the longest and _ larg- 
est pedigree yet was published, from a 
Neolithic site in France called Gurgy ‘les 
Noisats, where nearly 100 individuals have 
been sequenced. The site predates Hazle- 
ton by 1000 years, and it provides strong 
evidence for patrilocality: Dozens of peo- 
ple over seven generations traced their de- 
scent back to one man. To date, most of the 
dozen or so prehistoric family trees from 
Neolithic and Bronze Age Europe suggest 
patrilocality. “It’s surprising how consis- 
tent it’s been,” given the variety of social 
structures seen in recent societies, says Ian 
Armit, an archaeologist at the University 
of York who wasn’t part of the Gurgy team. 

Others caution that a handful of stud- 
ies from a 5000-year period across Europe 
isn’t enough to show patrilocality was a 
rule. “We're still looking at single-site stud- 
ies that don’t allow us to draw strong con- 
clusions,” says Wolfgang Haak, a geneticist 
at EVA. 

Some DNA from Gurgy posed riddles. 
There were no half-siblings, strange at 
a time when people died early and part- 
ners could have taken another mate. The 
absence suggests an avoidance of even 


A Neolithic family affair 


The complex family tree of a Neolithic clan buried in a chambered tomb called Hazleton North shows the promise of DNA to reveal ancient relationships. The founder was 
one man; shades of blue indicate his descendants with each of four female partners. The tomb holds many of his male descendants, along with their young children of 
both sexes (smaller symbols). Adult daughters are missing, presumably buried elsewhere. One third generation man descended from the founder on both sides. 
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Hazleton burial site 


A50-meter-long earthen 
mound was built as a 
tomb 5700 years ago near 
the modern town of 
Cheltenham, England. 
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serial partners, says Ghent University ge- 
neticist Maité Rivollat. “Maybe they had 
rules we don’t know about in terms of 
who was buried there, or maybe they were 
strictly monogamous.” 

At Gurgy, the team also did a new kind 
of analysis, adapting a technique com- 
monly used by commercial ancestry test- 
ing companies to identify distant cousins. 
Called “identical by descent,” or IBD, 
analysis, it compares the frequency and 
length of identical stretches of DNA to find 
more distant relatives. Standard ancient 
DNA analysis can connect people two de- 
grees apart at most—a grandmother to her 
grandson, for example. IBD analysis allows 
geneticists to trace distant cousins, great- 
great-grandparents, or beyond. 

IBD analysis showed how women influ- 
enced Gurgy’s patrilocal community. “We 
see one woman leaves the community and 
a couple of generations later her descen- 
dants come back in,” Ringbauer says. For 
example, a woman who had no daughter 
buried in the cemetery did have maternal 
descendants buried there. The missing fe- 
male relatives must have left and borne 
offspring elsewhere; those offspring’s de- 
scendants later returned and were buried 
at Gurgy. That suggests women stayed con- 
nected to their home village or birth family 
as part of a larger regional network. 

Researchers are beginning to apply IBD 
to other sites and time periods. In a recent 
preprint, for example, Ringbauer identified 
second cousins who were buried 5000 years 
ago—and 1400 kilometers apart—in south- 
ern Russia and central Mongolia. Both men 
were members of the Afanasievo culture, 
an eastern variant of the steppe Yamnaya 
people who made a major genetic contri- 
bution to European populations around 
that time. At least one of the cousins must 
have traveled hundreds of kilometers across 
the steppe during his life, an early exam- 
ple of dramatic mobility in the space of a 
few generations. 

As ancient DNA databases grow expo- 
nentially, Ringbauer hopes access to tens of 
thousands of published genomes will create 
a sort of 23andMe for the distant past, link- 
ing prehistoric people to one another across 
time and space. “We can now screen tens of 
thousands of ancient individuals for IBD,” 
Ringbauer says. “My vision is you’d add a 
study and immediately see, ‘Oh, there’s a 
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second cousin over there. 


PERHAPS CHASTENED by past criticisms, 
geneticists are now quick to concede that 
biological relatedness is only one element 
of kinship. Ethnographic studies of recent 
societies are rich in examples of biological 
parentage taking a back seat to other 
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kinds of family relationships. They include 
matrilineal systems in which the most 
significant male relative is the mother’s 
brother and communities where father- 
hood is determined by the transfer of cattle 
to a woman’s family. “Kinship is not just 
genetically determined, it’s socially con- 
structed,” says Joanna Bruck, a University 
College Dublin archaeologist. “There has 
to be an acknowledgement that genetics 
can be read in multiple ways.” 
Archaeology can help guide those read- 
ings, as an early study of biological relat- 
edness showed. It scrutinized the genomes 
of 84 people buried in small cemeteries 


Arising tide of ancient genomes 
Thanks to better methods, as of November 

2022 geneticists have genomewide data 

on more than 10,000 individuals. Researchers 
expect many more to come. 
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up and down Germany’s narrow, fertile 
Lech Valley between 2500 B.C.E. and 1650 
B.C.E., during the technological transi- 
tion from Neolithic farming to the Bronze 
Age. Published in 2017, the results showed 
patrilocality. But many Lech Valley cem- 
eteries included both biologically related 
individuals and those with no genetic links 
to others in the graveyard. 

Material culture helped explain the 
pattern. The unrelated people had many 
fewer grave goods, says Alissa Mittnik, 
a geneticist at EVA who co-authored the 
study. “The farmsteads were probably run 
by families who accumulated wealth. The 
others might have been farm workers, ser- 
vants, or enslaved people.” Those people 
did not establish families in one place 
across generations, but were buried in the 
plots of the presumably wealthier land- 
owners, who did. 

To puzzle out such community relation- 
ships, geneticists and archaeologists are 
forging stronger ties. For example, when 
Ludwig Maximilian University of Munich 
archaeologist Philipp Stockhammer, a 


lead author on the Lech Valley paper, was 
awarded a European Research Council 
grant to look for kinship patterns in Bronze 
Age Greece and the Aegean, his first step 
was to reach out to Greek archaeologists, 
asking for sites and questions where DNA 
could contribute. “Kinship analysis forces 
every lab to work closer with local archaeo- 
logists,’ Stockhammer says. “You can’t do 
without archaeologists anymore.” 

Earlier this year, Stockhammer’s team 
found that one-third of its samples from 
another Aegean project had long stretches 
of shared DNA, indicating they were the 
offspring of first cousins. That rate of 
close-kin interbreeding is highest docu- 
mented anywhere in the past, although 
some modern societies have similarly high 
levels of cousin marriage. 

Although the practice was slightly more 
frequent on isolated islands, it was seen 
across the Aegean, suggesting it was the 
result of a social practice or choice, rather 
than a lack of potential partners. 

“Why do you have long-distance mobil- 
ity of marriage partners in Central Europe 
in the Bronze Age, but not in Greece?” 
Stockhammer asks. Culture may hold an 
answer: The Bronze Age people of the Ae- 
gean grew olives and grapes. Those crops 
take decades to establish, offering an incen- 
tive to keep land in the family. “This kind of 
data lets us see how certain practices com- 
pare across regions,’ Stockhammer says. 
“The future of archaeogenetics is going to 
be zooming in, and then zooming out.” 

Paleogeneticists also hope to broaden 
their work beyond Europe, where research 
has flourished thanks to adequate funding, 
access to samples, and good preservation 
in the continent’s relatively cool caves and 
burials. (See p. 53 for a review of research 
outside of Europe.) The work could also 
inform modern medicine, if researchers 
can identify a genetically inherited disease 
and trace it back generations. “Pedigree 
studies on modern individuals have been 
useful in tracing rare variants and how 
they’re inherited,” Hofmanova says. “Nine 
generation pedigrees from the past—and 
soon longer—will enable you to trace rare 
variants much further than we can in 
modern individuals.” 

In 2008, Haak was one of the first to ap- 
ply ancient DNA to identify parents and 
children in a central German grave. Fif- 
teen years later, the scope of information 
that DNA can reveal about life in the past 
is rapidly widening. “The question when 
you approach every grave is: Who are they? 
Are they related? Does proximity in space 
mean kinship?” Haak says. “Getting closer 
to answering some of those questions is a 
long-held dream that came true.” 
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The risks of radioactive wastewater release 


The wastewater release from the Fukushima Daiichi nuclear plant is expected to have 
negligible effects on people and the ocean 


By Jim Smith’, Nigel Marks?, Tony Irwin? 


n 2011, the east coast of Japan suffered 
an earthquake and tsunami that resulted 
in the meltdown of three of the reac- 
tors at the Fukushima Daiichi nuclear 
plant. This led to an uncontrolled re- 
lease of large amounts of radioactive 
material to the surrounding land and to the 
Pacific Ocean. More than 10 years later, new 
releases of radioactive wastewater to the 
Pacific Ocean from the Fukushima plant have 
started. The historical ocean contamination 
from Fukushima included large quantities 
of long-lived radioactive cesium-137 ('°’Cs), 
so there is widespread concern over these 
new wastewater discharges. However, these 
releases will result in much lower total radia- 
tion doses to people and aquatic ecosystems 
than the contamination caused by the acci- 
dent (J—3). Furthermore, aquatic ecosystems, 
including those around the Chernobyl disas- 
ter site, have been shown to be remarkably 
resilient to radiation (4-7). Thus, the new 
Fukushima releases are not expected to cause 
substantial effects on seafood consumers or 
the marine ecosystem. 
The highest-activity radioactive contami- 
nant in the Fukushima wastewater is tritium 
(H), in the form of tritiated water (HTO). 


1University of Portsmouth, Portsmouth, UK. Curtin 
University, Perth, Australia. ?Australian National University, 
Canberra, Australia. Email: jim.smith@port.ac.uk 


SCIENCE science.org 


This molecule cannot be separated from the 
wastewater because its chemical behavior 
is the same as that of nonradioactive water. 
Like other radionuclides [such as natural car- 
bon-14 (*C) and anthropogenic ’Cs], which 
emit high-energy y rays and B particles when 
they decay, tritium can have biological effects 
on organisms, particularly DNA damage (8). 
But tritium’s radiotoxicity by ingestion is very 
low compared to that of these other radionu- 
clides owing to its very weak B emission and 
relatively short retention time in the body. 
Therefore, it is common practice for nuclear 
facilities worldwide to discharge wastewater 
containing HTO into the sea (see the figure). 

The La Hague nuclear facility in France 
annually discharges ~10,000 terabecquerels 
(1 TBq = 10” Bq; the becquerel is an amount 
of a radioactive element that produces one 
decay per second) of HTO into the English 
Channel, with annual discharges during 
1996-2016 ranging between 8000 and 12,000 
TBq (9). Radiation doses to people (measured 
in sieverts and defined as the amount of en- 
ergy deposited in tissue, taking account of the 
response of human biology to different radia- 
tion types) from the release of HTO from La 
Hague are low [<0.01 microsieverts per year 
(uSv/year) (9), which compares with the 1000 
uSv/year recommended limit for members of 
the public from nuclear site releases (2)], and 
no environmental impacts have been found 
or are expected. For example, studies of fish 


(fathead minnow, Pimephales promelas) ex- 
posed to levels of HTO between 7700 and 
21,900 Baq/liter in a river near a Canadian 
nuclear site (compared with <50 Bq/liter in 
seawater near La Hague) have indicated that 
it can induce DNA damage (as measured by 
comet assay), but no effects were observed 
on the survival of fish or various indices of 
health (70). 

At Fukushima, 22 TBq of HTO per year will 
be released (J, 2), which is a factor of ~450 
less than that from La Hague per year. The 
planned Fukushima discharge annual limit 
is the same as when the facility was an op- 
erational electricity-generating boiling water 
reactor (BWR). Less tritium is produced in a 
BWR than in other types of nuclear reactor, 
so the limit was originally set to be low com- 
pared with that of some other reactor types. 
The very low release limit for the Fukushima 
wastewater was selected by the Japanese gov- 
ernment owing to societal concerns such as 
the reputational impact on the local fishing 
industry. The planned releases will therefore 
take much longer, ~30 years, than strictly 
necessary according to current radiation 
safety recommendations. 

There will be radionuclides other than tri- 
tium in the Fukushima discharge. The waste- 
water has been treated by passing it through 
a series of columns in which ion-exchange 
chemical reactions remove most of the ra- 
dioactive elements that the wastewater con- 
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tains. It will be treated again before release 
if radiation levels are higher than the dis- 
charge limits. But tiny amounts of 30 other 
radionuclides remain after treatment, in- 
cluding “C, Co, °°Sr, ¥’Cs, and Pu isotopes 
C, 2). In this regard, the Fukushima release 
is no different from routine releases from 
other nuclear sites around the world. For 
example, in 2019, the Sellafield nuclear site 
in the UK discharged 19 times more HTO, 
1600 times more “C, 320 times more °°Co, 
180 times more I, and over 1000 times 
more °°Sr, ®’Cs, and Pu isotopes (J, 77) than 
is planned annually for the Fukushima re- 
lease. In 2019, increased radiation doses to 
seafood consumers near Sellafield were low 
and mainly due to historical discharges of 
natural radioactivity from a nearby non- 
nuclear site that produced phosphoric acid 
from uranium-rich phosphate ores, with 
the nuclear site contributing about a third 
of the dose of the phosphoric acid manu- 
facturer (11). 

During discharge, the Fukushima waste- 
water is diluted 100 times so that tri- 
tium levels will be 2.5% of the Japanese 
Government regulatory limit and the sum 
of the 30 other relevant radionuclides will 
be less than 1% of the limits. The planned 
maximum of 1500 Bq/liter of tritium in the 
discharge water is ~14% of the 10,000 Bq/ 
liter that the World Health Organisation 
designates as the drinking water limit for 
tritium (12). 

Monitoring will take place to ensure that 
the levels of radiation in the Fukushima 
releases are below the regulatory limits (J, 
2). This includes analyses of the wastewater 
for all 30 radionuclides present, including 
tritium, total a and B radiation, and the en- 
ergy spectrum of y-emitting radionuclides. 
The process will be independently verified 
by the International Atomic Energy Agency 
(IAEA) by measuring wastewater, seawater, 
and aquatic organisms near the 1-km-long 
offshore discharge pipeline during the entire 
~30-year discharge period (2). Since the first 
release began on 24 August 2023, publicly 
available information from Tokyo Electric 
Power Company (TEPCO), the IAEA and the 
Japanese Ministry for Agriculture, Fisheries 
and Food, reveals that tritium levels are neg- 
ligible within 3 km of the site. Of 251 seawa- 
ter measurements taken by 13 September at 
10 sites within 3 km of the Fukushima plant, 
only one (10 Bq/1) is above the detection limit 
of ~8 Bq/l. No detectable tritium has been 
found in 25 fish at two sites 4 to 5 km North 
and South of the discharge point (limit of 
detection ~8 Bq/kg). Tritium is likely to be 
detectable by more sensitive methods and 
when releases are higher than that of the 
first batch release (~200 Baq/1 of tritium). 

Concerns have been expressed about bio- 


32 6 OCTOBER 2023 * VOL 382 ISSUE 6666 


Tritium from the Fukushima Daiichi releases in context 

Radioisotopes, both naturally occurring and anthropogenic, exhibit different levels of radiotoxicity [based on 
(15)]. Tritium has a small dose coefficient mainly because its radiation is weak. In the Pacific Ocean, tritium 
contributes far less total radioactivity than naturally occurring radioisotopes. The planned annual release of 
tritium from Fukushima Daiichi is much lower than many releases from elsewhere, and the predicted annual 
radiation doses to local seafood consumers are much smaller than other sources of radiation. 
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magnification—the increased concentration 
of pollutants in organisms—of tritium and 
its potential detrimental effects on marine 
life, but these concerns do not correctly 
reflect the risk. If, as planned, tritium is 
released in the form of HTO, it cannot bio- 
magnify because its biological uptake and 
distribution follow that of the greater mass 
of water (13), which does not biomagnify. 
This lack of biomagnification is due to the 
chemical similarity of HTO and H,0, which 
means that the °H/'H ratio remains essen- 
tially unchanged in chemical and biological 


processes (13). Indeed, this is the very reason 
that HTO cannot be separated from ordinary 
water by Fukushima’s water treatment facil- 
ity. Tritium can be retained longer in organic 
molecules (and thus in organisms) as organi- 
cally bound tritium (OBT) (8) when HTO is 
digested by aquatic organisms. But even as- 
suming long exposure times, the radiation 
dose rates are vanishingly small. 

Radiation doses of tritium and the 
30 other radionuclides in the treated 
Fukushima wastewater have been calculated 
by independent researchers (3) and the IAEA 
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(2) using models that include dispersion in 
water, biomagnification, the different ra- 
diotoxicities, and longer retention times in 
organisms. These calculations result in very 
low dose rates (measured in energy depos- 
ited in tissue, Gy) to organisms from all ra- 
dionuclides in the release: <0.005 wGy/hour 
(3) and <0.0001 wGy/hour (2). These levels 
are below the 40 wGy/hour dose rate that is 
considered to be the limit at which no eco- 
system effects from damage to animal and 
plant tissues occur (/4). 

Aquatic ecosystems are surprisingly resil- 
ient to radioactive pollution because of the 
relatively high doses that are required to 
damage an organism’s development and re- 
production. Studies in lakes near Chernobyl 
and Fukushima found no effect of radiation 
on aquatic invertebrate abundance and di- 
versity (7), or on developmental and genetic 
indicators (4, 5). In perch (Perca fluviatilis), 
a slight delay in maturation of female go- 
nads was associated with radiation levels in 
lakes at Chernobyl, but no effects were seen 
in roach (Rutilus rutilus), and fish condition 
indices were no different from those of un- 
contaminated lakes (6). Moreover, the fish 
population in the most contaminated lakes 
is diverse (6). Radiation dose rates in wa- 
terbodies at Chernobyl are more than 1000 
times higher than those expected from the 
Fukushima discharges (3, 6). 

Radiation doses from the radioactive iso- 
topes in the planned Fukushima releases to 
people, through all potential pathways in- 
cluding seafood consumption and bathing, 
are calculated to be less than 1 pSv/year (2, 
3). This is thousands of times smaller than 
the 2400 uSv/year global average radiation 
dose from natural background radiation. 
Risks from natural radiation primarily re- 
sult from DNA damage by internal expo- 
sure to a and B radiation and to y radiation 
originating inside or outside the body. So, 
the mode of action of naturally occurring 
radiation can be compared to that of the ra- 
dioactivity from Fukushima. Natural radia- 
tion doses vary widely—millions of people 
worldwide receive doses >10,000 [Sv/year. 
Current radiation protection models as- 
sume that any extra dose could potentially 
cause cancer, because all ionizing radiation 
could damage DNA. But a radiation dose of 
less than 1 uSv/year from the Fukushima 
release is not expected to have any public 
health consequence compared with, for ex- 
ample, natural radiation. 

The impact of the Fukushima releases on 
the Pacific Ocean can also be understood by 
considering radioactivity in the natural envi- 
ronment. There is ~860 TBq of tritium in the 
Fukushima storage tanks, with 22 TBq dis- 
charged to the Pacific Ocean every year. The 
Pacific Ocean currently contains 500,000 
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TBq of natural and 2,500,000 TBq of an- 
thropogenic tritium, the latter almost wholly 
from past atmospheric nuclear weapons test- 
ing. Furthermore, all tritium contributes just 
0.04% to the total radioactivity of 8 billion 
TBq in the Pacific Ocean, where most radio- 
activity is due to naturally occurring potas- 
sium-40 (91%) and rubidium-87 (8.6%). It is 
important to note that there has long been 
awareness of radioactivity in seawater, but 
this has not been considered a health hazard. 

Fears of radiation are likely to damage 
the livelihoods of Fukushima’s fishing com- 
munity, who are still recovering from fishing 
bans and reputational damage caused by the 
2011 accident. To protect marine environ- 
ments in the best possible way, resources 
and attention should be focused on key 
stressors including climate change, overfish- 
ing, and plastic pollution. The radiation pro- 
tection science is clear that the Fukushima 
wastewater release presents no real threat 
to the organisms of the Pacific Ocean or to 
Fukushima’s seafood consumers if carried 
out as planned. Any substantial deviation 
from the release plans would quickly be no- 
ticed by monitoring. Governments and re- 
searchers from elsewhere are also expected 
to closely monitor radioactivity in the Pacific 
Ocean during the release. The planned re- 
lease at Fukushima is likely to be the most 
closely monitored wastewater discharge 
from a nuclear site. 
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Pregnancy 
programs 
the brain for 
mothering 


Progesterone and estrogen 
exert separate effects on 
the brains of pregnant mice 


By Margaret M. McCarthy 


regnancy requires extensive physi- 

ological remodeling to provide the 

energy required for gestating a fetus 

and delivering and nurturing a new- 

born. Successful mothering requires 

an additional finely tuned behavioral 
repertoire. In most mammals, this starts 
with retrieval to a nest, grooming, and nurs- 
ing. Estradiol and progesterone, the same 
pregnancy hormones that remodel the body, 
also program the brain for mothering, but 
precisely how this occurs has been opaque. 
On page 76 of this issue, Ammari et al. (1) re- 
port the identification of galanin neurons in 
the medial preoptic area (MPOA) as the cel- 
lular target of steroid programming in mice. 
Estradiol and progesterone changed the ac- 
tivity of galanin neurons such that the sen- 
sitivity to stimuli emanating from the pups 
(e.g., odors and vocalizations) was much 
stronger. New synapses were also formed 
on the galanin neurons. These remarkably 
specific changes were sufficient to promote 
mothering behavior. 

Steroid hormones, such as estradiol and 
progesterone, are long-distance signaling 
molecules that are produced by endocrine 
organs (gonads, adrenals, and the placenta) 
and coordinate tissue responses across the 
body. Successful pregnancy and parturition 
require the orchestration of tissue remodel- 
ing of the mammary gland, uterus, and cer- 
vix along with construction of a placenta—all 
of which are responses to steroid hormones. 
However, the new mother also needs to re- 
spond to her newborns by providing pro- 
tection, warmth, grooming, and nutrients 
through lactation. An absence of or a delay in 
this response can be catastrophic for species 
such as rats and mice, in which newborns 
have no capacity for thermoregulation or 
mobility. Synthesis of estradiol and proges- 
terone is up-regulated during pregnancy, and 
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(2) using models that include dispersion in 
water, biomagnification, the different ra- 
diotoxicities, and longer retention times in 
organisms. These calculations result in very 
low dose rates (measured in energy depos- 
ited in tissue, Gy) to organisms from all ra- 
dionuclides in the release: <0.005 wGy/hour 
(3) and <0.0001 wGy/hour (2). These levels 
are below the 40 wGy/hour dose rate that is 
considered to be the limit at which no eco- 
system effects from damage to animal and 
plant tissues occur (/4). 

Aquatic ecosystems are surprisingly resil- 
ient to radioactive pollution because of the 
relatively high doses that are required to 
damage an organism’s development and re- 
production. Studies in lakes near Chernobyl 
and Fukushima found no effect of radiation 
on aquatic invertebrate abundance and di- 
versity (7), or on developmental and genetic 
indicators (4, 5). In perch (Perca fluviatilis), 
a slight delay in maturation of female go- 
nads was associated with radiation levels in 
lakes at Chernobyl, but no effects were seen 
in roach (Rutilus rutilus), and fish condition 
indices were no different from those of un- 
contaminated lakes (6). Moreover, the fish 
population in the most contaminated lakes 
is diverse (6). Radiation dose rates in wa- 
terbodies at Chernobyl are more than 1000 
times higher than those expected from the 
Fukushima discharges (3, 6). 

Radiation doses from the radioactive iso- 
topes in the planned Fukushima releases to 
people, through all potential pathways in- 
cluding seafood consumption and bathing, 
are calculated to be less than 1 pSv/year (2, 
3). This is thousands of times smaller than 
the 2400 uSv/year global average radiation 
dose from natural background radiation. 
Risks from natural radiation primarily re- 
sult from DNA damage by internal expo- 
sure to a and B radiation and to y radiation 
originating inside or outside the body. So, 
the mode of action of naturally occurring 
radiation can be compared to that of the ra- 
dioactivity from Fukushima. Natural radia- 
tion doses vary widely—millions of people 
worldwide receive doses >10,000 [Sv/year. 
Current radiation protection models as- 
sume that any extra dose could potentially 
cause cancer, because all ionizing radiation 
could damage DNA. But a radiation dose of 
less than 1 uSv/year from the Fukushima 
release is not expected to have any public 
health consequence compared with, for ex- 
ample, natural radiation. 

The impact of the Fukushima releases on 
the Pacific Ocean can also be understood by 
considering radioactivity in the natural envi- 
ronment. There is ~860 TBq of tritium in the 
Fukushima storage tanks, with 22 TBq dis- 
charged to the Pacific Ocean every year. The 
Pacific Ocean currently contains 500,000 
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TBq of natural and 2,500,000 TBq of an- 
thropogenic tritium, the latter almost wholly 
from past atmospheric nuclear weapons test- 
ing. Furthermore, all tritium contributes just 
0.04% to the total radioactivity of 8 billion 
TBq in the Pacific Ocean, where most radio- 
activity is due to naturally occurring potas- 
sium-40 (91%) and rubidium-87 (8.6%). It is 
important to note that there has long been 
awareness of radioactivity in seawater, but 
this has not been considered a health hazard. 

Fears of radiation are likely to damage 
the livelihoods of Fukushima’s fishing com- 
munity, who are still recovering from fishing 
bans and reputational damage caused by the 
2011 accident. To protect marine environ- 
ments in the best possible way, resources 
and attention should be focused on key 
stressors including climate change, overfish- 
ing, and plastic pollution. The radiation pro- 
tection science is clear that the Fukushima 
wastewater release presents no real threat 
to the organisms of the Pacific Ocean or to 
Fukushima’s seafood consumers if carried 
out as planned. Any substantial deviation 
from the release plans would quickly be no- 
ticed by monitoring. Governments and re- 
searchers from elsewhere are also expected 
to closely monitor radioactivity in the Pacific 
Ocean during the release. The planned re- 
lease at Fukushima is likely to be the most 
closely monitored wastewater discharge 
from a nuclear site. 
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programs 
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the brains of pregnant mice 
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regnancy requires extensive physi- 

ological remodeling to provide the 

energy required for gestating a fetus 

and delivering and nurturing a new- 

born. Successful mothering requires 

an additional finely tuned behavioral 
repertoire. In most mammals, this starts 
with retrieval to a nest, grooming, and nurs- 
ing. Estradiol and progesterone, the same 
pregnancy hormones that remodel the body, 
also program the brain for mothering, but 
precisely how this occurs has been opaque. 
On page 76 of this issue, Ammari et al. (1) re- 
port the identification of galanin neurons in 
the medial preoptic area (MPOA) as the cel- 
lular target of steroid programming in mice. 
Estradiol and progesterone changed the ac- 
tivity of galanin neurons such that the sen- 
sitivity to stimuli emanating from the pups 
(e.g., odors and vocalizations) was much 
stronger. New synapses were also formed 
on the galanin neurons. These remarkably 
specific changes were sufficient to promote 
mothering behavior. 

Steroid hormones, such as estradiol and 
progesterone, are long-distance signaling 
molecules that are produced by endocrine 
organs (gonads, adrenals, and the placenta) 
and coordinate tissue responses across the 
body. Successful pregnancy and parturition 
require the orchestration of tissue remodel- 
ing of the mammary gland, uterus, and cer- 
vix along with construction of a placenta—all 
of which are responses to steroid hormones. 
However, the new mother also needs to re- 
spond to her newborns by providing pro- 
tection, warmth, grooming, and nutrients 
through lactation. An absence of or a delay in 
this response can be catastrophic for species 
such as rats and mice, in which newborns 
have no capacity for thermoregulation or 
mobility. Synthesis of estradiol and proges- 
terone is up-regulated during pregnancy, and 
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these hormones act syner- 
gistically to promote repro- 
ductive behavior (mating 
and parenting). However, 
connecting steroid action to 
neuronal activity and behav- 
ioral modification has been 
challenging. 

Steroids exert their ef- 
fects, which can be long- 
lasting, through changes in 
gene transcription. To do 
this, estradiol binds to es- 
trogen receptor 1 (ESRI), 
and progesterone binds to 
progesterone receptor (PR). 
ESR1 and PR are expressed 
at high concentrations in the 
MPOA, and the region plays 
an important role in control- 
ling and coordinating sex 
differences in behavior, most 
notably mating and parent- 
ing (2). Neurons, astrocytes, 
and microglia of the MPOA 
exhibit different proper- 
ties in males and females, 
including differences in 
morphology, synaptic density, and transcrip- 
tomic profile (3). Most neurons in the region 
are inhibitory—using y-aminobutyric acid 
(GABA) as a neurotransmitter—and express 
ESR1. Galanin neurons are key regulators of 
maternal behavior (4), but they constitute 
only ~20% of ESR1-expressing neurons in the 
MPOA, and whether ESR1 and PR activation 
is required for the effects of galanin neurons 
on behavior was not known. 

Ammari et al. generated a mouse model in 
which Es71 and Pgr (which encodes PR) could 
be selectively ablated in MPOA galanin neu- 
rons. They did this in virgin mice and then 
allowed half of them to mate. The females 
who had pups showed a complete loss of nor- 
mal nurturing behavior, with no retrieval to 
the nest or attempts at nursing. The authors 
determined that activation of ESR1 during 
pregnancy tuned the response of galanin 
neurons to pup stimuli by lowering the mem- 
brane potential and silencing most galanin 
neurons while selectively increasing the ex- 
citation of others, thus enhancing the signal- 
to-noise ratio. This fine-tuning faded as the 
pups matured. Activation of PR exerted a dif- 
ferent effect by inducing excitatory synapse 
formation on the galanin neurons (see the 
figure). The identity of the neurons making 
new synapses onto the galanin neurons was 
not addressed. Unlike the transient nature of 
the ESR1-mediated effect, the PR-mediated 
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Mothering 


Hormones of pregnancy promote maternal behavior 
Estradiol and progesterone levels increase during pregnancy and exert independent but 
coordinated effects on galanin neurons in the preoptic area of mice. Estradiol silences 
some galanin neurons and promotes the activity of others to increase the strength and 
selectivity of responses to stimuli emanating from newborn pups. Progesterone promotes 
the formation of new excitatory inputs, which endure past the time of immediate need for 
nurturing and thereby permanently change the behavior of female mice toward newborns. 


Silenced activity 


Estradiol 


changes endured and promoted nurturing 
behavior long after the original offspring 
were weaned. This observation may explain 
why females of many species behave mater- 
nally toward offspring that are not their own 
once they have experienced motherhood. 
One of the important advances but also a 
remaining mystery from the work by Ammari 
et al. is that loss of Esr] or Pgr expression in 
the galanin neurons was sufficient to block 
the induction of maternal behavior by preg- 
nancy. This was unexpected given the broad 
hormonal milieu of pregnancy, which in- 
cludes increased oxytocin and prolactin, and 
because many other neuronal types express 
both steroid receptors. It may be that some 
component of pregnancy creates this speci- 
ficity—a hypothesis that can be tested by in- 
ducing maternal behavior in virgins by other 
means (5). Regardless, the specificity is es- 
sential to avoid steroids promoting behaviors 
that are incompatible with parenting, such as 
mating or aggression (6). Future studies ex- 
ploring the characteristics of neurons not re- 
sponsive to the conditions of pregnancy will 
provide further information on the neural 
source of hormonally modulated behaviors. 
Moreover, deeper interrogation of the ESR1- 
and PR-responsive galanin neurons will help 
inform understanding of other aspects of 
mothering, including why it sometimes fails, 
resulting in females either ignoring or being 
aggressive toward pups. Equally important 
is how nutrients, stress, and substances of 
abuse affect the neural response to pregnancy 
and what epigenetic modifications might oc- 
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cur in these neurons. 

Although mothering is 
arguably the fulcrum of re- 
productive success, a small 
percentage of mammalian 
species, including humans, 
benefit from fathering. 
Because fathers do not ex- 
perience the hormonal mi- 
lieu of pregnancy, it is not 
always clear how fathering 
behavior is induced. A study 
of more than 600 married 
men revealed a precipitous 
drop in circulating testos- 
SS terone when they became 
fathers, and the more en- 
gaged that they were with 
their children, the lower 
the testosterone dropped, 
particularly during the first 
month after birth (7). By the 
time the child was 1 year 
old, testosterone levels were 
close to those of prefather- 
hood. Paternal behavior 
is much less studied than 
mothering in rodent mod- 
els, but the findings of Ammari et al. pro- 
vide information on where to begin, starting 
with determining if galanin neurons of the 
MPOA are activated during, and required 
for, fathering. If so, the next question would 
be whether testosterone fine-tunes the excit- 
ability of galanin neurons similar to estra- 
diol and progesterone, which would suggest 
a common neural underpinning of parent- 
ing behavior. If the mechanism of father- 
ing is different, it would be an example of 
compensation in which males overcome the 
limitation of not experiencing pregnancy 
and parturition through the creation of a 
separate neural process (8). The study by 
Ammari et al. also offers insights into how 
steroid signaling may be transduced to 
regulate competing behavioral repertoires, 
such as aggression, mating, and parenting 
by activating only a small subset of hormon- 
ally responsive neurons among the larger 
population. 
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CHEMISTRY 


An oscillating reaction to produce clean fuels 


Understanding chemical reaction mechanisms could help synthesize sustainable fuels 


By Hans Niemantsverdriet and 
Kees-Jan Weststrate 


scillating chemical reactions, in which 

reactions “flip-flop” reversibly, abound 

in living matter but are rare in the 

world of industrial chemistry. An oscil- 

lating reaction in solution, sometimes 

used for live demonstrations in chem- 
istry classes, is the Belouzhov-Zhabotinsky 
reaction, in which color-changing patterns 
spread over a Petri dish (7). Oscillations may 
occur in chemical reactors 
that contain solid catalysts 
but are often elusive be- 
cause they need to be syn- 
chronized over the entire 
volume to be observable 
(2). On page 99 of this is- 
sue, Zhang et al. (3) report 
previously unseen self-sus- 
tained oscillations of the 
Fischer-Tropsch synthesis 
(FTS), a complex reaction 
that produces hydrocar- 
bon fuels from syngas [a 
mixture of carbon mon- 
oxide (CO) and hydrogen 
(H,)]. The authors used a 
flow reactor packed with 
a cobalt catalyst and a 
cerium oxide promoter. 
Understanding why such 
oscillations arise may im- 
prove mechanistic insight 
of this process that fea- 
tures in future scenarios 
for developing sustain- 
able fuels. 

FTS is the reaction be- 
tween CO and H, gases to 
form hydrocarbons, using 
cobalt or iron catalysts at 
moderate pressure (20 to 
40 bar), and temperatures 
between 220° and 350°C (2). Discovered in 
1925 and used in World War II to convert coal 
through syngas to hydrocarbon fuels for mili- 
tary applications, FTS is currently applied to 
convert natural gas and coal into ultraclean 
transportation fuels for cars and airplanes 
and into a versatile feedstock for the fabri- 
cation of more complex chemicals. FTS fea- 
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CO dissociation: Direct or assisted by hydrogen 


tures prominently in future energy scenarios 
for storing electricity and for the production 
of sustainable aviation fuels, using captured 
CO, and “green” H, from water electrolysis 
with solar or wind power. 

Mechanistically, FTS is a sequence of many 
elementary reactions on the surface of the 
catalyst, starting with the dissociation of CO 
and H,, which form a monomer that polym- 
erizes into paraffins, olefins, and oxygenated 
products. There are two main hypotheses on 
how the reaction proceeds (see the figure) 


Elementary Fischer-Tropsch reactions on a cobalt catalyst 
Carbon monoxide (CO) dissociates directly or is assisted by H atoms on the cobalt catalyst 
containing a cerium oxide promoter. Hydrocarbon chains grow by addition of methylidyne 
(CH) monomers or by insertion of CO followed by hydrogenation and desorption from the 
catalyst surface. The reaction rates and the hydrocarbon concentrations oscillate with time 
and according to temperature. 


Direct 


H-assisted 


H-assisted 


+H 


Hydrocarbons grow by CH or CO insertion 


(4). The first assumes that CO dissociates 
at atomic steps on the cobalt surface, after 
which C and H react to form hydrocarbons, 
and O atoms hydrogenate to water, which 
detaches from the catalyst. Water formation 
is the rate-determining step (the activation 
energy is 120 to 130 kJ/mol), whereas the 
direct splitting of the C-O bond requires 
considerably less energy (~100 kJ/mol) (5). 
Subsequent reactions to form hydrocarbons 
include the repeated coupling of CH mono- 
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mers with alkylidyne (-CH-C_H,,,,) frag- 
ments, which is promoted by co-adsorbed 
CO, and proceeds with low activation barri- 
ers of 60 to 70 kJ/mol (6). Desorption of the 
hydrocarbon occurs at a higher activation 
energy of 80 to 100 kJ/mol, ensuring that hy- 
drocarbon chain growth is favorable. The sec- 
ond hypothesis assumes that H atoms assist 
in breaking the C-O bond through a CHO(H) 
intermediate (4). Hydrocarbon chain growth 
is thought to occur from CO insertion into 
the growing chain, whereas O rejection hap- 
pens later in the reaction 
sequence. Such CO inser- 
tion mechanisms account 
for the formation of, for 
example, alcohols and ke- 
tones, which form in small 
but detectable amounts. 
Computational modeling 
supports the feasibility of 
such routes (4), but experi- 
mental proof has not been 
reported so far. 

Zhang et al. demon- 
strate that the intricate set 
of surface reactions con- 
stituting the FTS exhibit 
self-sustained oscillations 
in the concentration of re- 
actants and products that 
go on for several hours, 
provided that the cobalt 


catalyst contains cerium 
6 CH,CH=CH, Olefins oxide as a promoter (a 
a CH,CH,CH, Paraffins material that facilitates a 
reaction without being a 
Se catalyst). | Concentration 
oscillations occur between 

— 
CH;CH,=0 Aldehydes 200° and 260°C and 1 atm, 
+5H CH3CH,CH,OH Alcohols implying that the hydro- 
carbon products are small 
_s +3H CH=CH, Olefins and in the gas phase. The 
+5H CH;CH, Paraffins authors present a kinetic 


model in which CO is pro- 
posed to react with an alkoxy (R-O-) adsor- 
bate to form a carboxylate key intermediate 
(R-COO) followed by partial hydrogenation 
to produce either an alkoxy intermediate 
for further growth or the final, oxygen-free 
hydrocarbon product. With a kinetic model 
that specifically includes the effect of temper- 
ature, they demonstrate that applying tem- 
perature variations as shown by the experi- 
ments, reproduces the observed oscillations 
in rate and product selectivity. However, this 
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model does not explain why such oscillations 
develop spontaneously, which requires fur- 
ther study. 

Devising a kinetic model that reproduces 
the spontaneous oscillations observed by 
Zhang et al.—that is, without external pertur- 
bations such as temperature or pressure—is 
a challenge for which the literature offers in- 
spiration (7, 8). For example, in addition to 
surface concentrations and rates of elemen- 
tary reactions, factors such as exothermicity 
(released heat) and heat flow in the reactor 
need to be considered because these ulti- 
mately cause the oscillations in temperature 
and reactivity. Highlighting the sensitivity 
to heat transport, Zhang et al. report that 
oscillations disappear when they switch the 
diluting medium—present for analytical 
purposes—from argon to helium, which in- 
creases heat conductivity. 

The role of cerium oxide, a material 
that binds and stores oxygen in automo- 
tive exhaust systems (2), is still unclear. In 
a scenario in which O removal is the rate- 
determining step, it could be envisaged that 
cerium suboxide [which contains cerium(IID 
(Ce**)] attracts O atoms, accelerating the C-O 
dissociation and thus all subsequent path- 
ways into hydrocarbons. Once the absorption 
capacity of cerium oxide is saturated, the re- 
action slows down, making the environment 
more reducing and causing reduction of Ce* 
back to Ce**, which then resumes its acceler- 
ating role until it is saturated again. Cleverly 
designed experiments to monitor the cerium 
oxidation state in situ may test this hypoth- 
esis. Likewise, experiments aimed at deter- 
mining surface coverage of reacting species 
during the oscillations could also provide in- 
formation on the reaction mechanism. 

The finding of Zhang et al. that a catalytic 
system as complex as FTS may spontaneously 
develop oscillations that extend over an en- 
tire reactor is a notable result. It may open 
new avenues for exploratory research of oscil- 
lations in CO conversion and how they could 
offer a way to improve the overall perfor- 
mance of the FTS process, especially when 
synthesizing green fuels. & 
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A site in White Sands National Park, New Mexico, USA, contains clearly identifiable ancient human footprints. 


ANTHROPOLOGY 


Dating the arrival of humans 


in the Americas 


A debate about the age of ancient footprints continues 


By Bente Philippsen 


ating the oldest evidence for the pres- 

ence of Homo sapiens in the Ameri- 

cas is a matter of ongoing debate. 

One view is that the earliest such evi- 

dence is from 16,000 to 14,000 years 

ago, after the Last Glacial Maximum 
(LGM), when people would have crossed the 
Beringian strait from Siberia over a dry land 
bridge (1). However, controversial radiocar- 
bon dating of seeds from a set of preserved 
footprints in the White Sands National Park 
in New Mexico, USA, indicates that humans 
were present in the Americas as early as 
21,000 to 23,000 years ago—during the LGM 
(2). This site has already inspired a vivid dis- 
cussion about dates and human migrations 
(2-6). On page 73 of this issue, Pigati et al. 
(7) present a new approach and independent 
data from the White Sands site that may be 
the most convincing evidence for pre-LGM 
presence of H. sapiens in America. 

Evidence for the post-LGM migration of H. 
sapiens from Siberia to America is present 
at many archaeological sites and has left ge- 
netic traces in the current population of the 
Americas (1). However, this does not exclude 
the possibility of earlier migrations. Evidence 
for the peopling of other continents shows 
that H. sapiens were able to cross large dis- 
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tances over land and even the sea tens of 
thousands of years ago. The White Sands site 
contains several sediment layers with clearly 
identifiable human footprints (2). The study 
that produced the first estimate of 21,000 
to 23,000 years for the age of the footprints 
used radiocarbon dating of the seeds of the 
aquatic plant ditchgrass (Ruppia cirrhosa) 
(2), which were embedded between sedi- 
ment layers that contained the footprints. 
If this estimate is correct, the site is much 
earlier than other well-dated sites with clear 
evidence of human occupation and could 
be a convincing sign of human presence in 
America during the LGM (J). 

Discussions about the White Sands site 
have mainly focused on the challenges in- 
volved in dating it. Radiocarbon dating 
is a well-established and widely applied 
method of age determination in archae- 
ology. However, there are some potential 
sources of error that have to be taken into 
account. One such source is the freshwa- 
ter reservoir effect (3-6, 8, 9). This occurs 
when submerged plants photosynthesize 
dissolved inorganic carbon from the water. 
The dissolved inorganic carbon can origi- 
nate from atmospheric carbon dioxide and 
thus have the same age as that of contempo- 
rary terrestrial vegetation. It can, however, 
also originate from ancient carbon sources: 
groundwater that had been without contact 
to the atmosphere for millennia, dissolved 
carbonate minerals, or mineralized organic 
matter from peat bogs. Therefore, aquatic 
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plants such as R. cirrhosa can incorporate 
ancient carbon and thus appear older than 
they are when they are radiocarbon dated. 

The difference between the plant’s ra- 
diocarbon age and the radiocarbon age of 
contemporary terrestrial plants is called the 
reservoir age. In most cases, the dissolved 
inorganic carbon in water bodies has a mul- 
titude of sources that can lead to potentially 
very high and highly variable reservoir ages. 
The concentrations of the “C isotope in the 
dated samples should always be included 
because they can help to identify the carbon 
source(s) and would have been a valuable ad- 
dition to the debate about 
the age of the footprints (2). 
Another potential source 
of error in (2) is the rede- 
position of the R. cirrhosa 
seeds. They could have been 
preserved in much earlier 
sediments and redeposited 
millennia later to form the 
seed layers in which the 
footprints were set. 

To avoid the freshwater 
reservoir effect, Pigati et 
al. performed radiocarbon 
dating of nonaquatic pollen 
grains from the White Sands site. With the 
aim of avoiding error introduced through 
redeposition, the authors also used optically 
stimulated luminescence (OSL) dating of 
the sediment itself [a solution to the debate 
that was suggested in (4)], which determines 
when mineral grains such as quartz were last 
exposed to light. The approach can therefore 
date when sediments were redeposited by 
wind and similar processes. Both methods 
confirm the age of the White Sands foot- 
prints as 21,000 to 23,000 years ago. 

The approaches used by Pigati et al. are 
not without limitations. Radiocarbon dat- 
ing of pollen grains is an intricate process 
that comes with a risk of contamination. 
OSL dates can have large measurement un- 
certainties, and factors such as background 
radioactivity and water saturation have 
to be known to calculate accurate ages. 
Furthermore, the quartz grains could have 
been shaded by other material such as gyp- 
sum, which is prevalent in the sediments of 
the White Sands site, and thus not be fully 
exposed to light during redeposition (“in- 
complete bleaching”). However, it is highly 
unlikely that all the error sources present 
across the Pigati et al. study and in (2) intro- 
duced the same magnitude of error. Thus, 
even allowing for substantial errors in ra- 
diocarbon dating and OSL, the data overall 
from the Pigati et al. study strongly indicate 
human presence in the Americas around 
the LGM. 

The controversy surrounding the dating 
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of the White Sands footprints provides some 
valuable insights for dating this and other 
potentially very old sites. In particular, the 
findings of Pigati et al. emphasize that the 
context of a sample is one of the most im- 
portant pieces of information in radiocarbon 
dating. To optimize dating accuracy, the car- 
bon source analyzed in a radiocarbon study 
should be as physically and temporally close 
to the evidence of human action as possible. 
In terms of dating precision, much can be 
gained from including stratigraphy, eleva- 
tion, or dates from other methods in the ra- 
diocarbon age model. Such a combination 
of methods and materials 
can target several differ- 
ent mechanisms of carbon 
uptake and sample deposi- 
tion. Although there is no 
guarantee for accurate dates 
even when using all of these 
strategies, it is highly un- 
likely that different methods 
applied to different materi- 
als will always result in sys- 
tematic errors of the same 
direction and same order of 
magnitude. 

Absolute dates for archao- 
logical sites are necessary to connect human 
migrations and cultural development with 
climate and environment. Future projects 
could include following the White Sands 
footprints to a potential campsite, which 
might yield further evidence of human oc- 
cupation such as stone tools [suggested by 
(3)]. Furthermore, targeted sampling of ad- 
ditional materials for radiocarbon and other 
dating techniques, and DNA analysis of 
ancient bones and sediments on the White 
Sands site and other potentially early sites, 
could provide useful information. Results 
and materials from rescue and contract ar- 
chaeology (such as archaeological investiga- 
tion before road construction) should also 
be reevaluated and dated with the same 
scrutiny as those from research excavations. 
Last, the conclusions from the conversation 
around the dating of the White Sands foot- 
prints can help to inform future require- 
ments for rescue and contract archaeology, 
such as deeper digs to access earlier layers, 
or the collection of dating samples for future 
research projects. 
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Damaging 
diamond with 
shock waves 


Deformations inflicted in 
diamond could help 
design stronger materials 


By Marcus D. Knudson and 
Christopher T. Seagle 


he most extreme thermodynamic con- 
ditions that can be produced in the 
laboratory are created by using shock 
waves. Shock waves are the waves gen- 
erated in a material after the inflic- 
tion of a mechanical impact or rapid 
deposition of energy, such as from a high- 
intensity laser. When a strong shock propa- 
gates within a single crystal of a material, 
it deforms the crystalline lattice. This de- 
formation is accommodated by the creation 
and propagation of dislocations—crystal de- 
fects that contain an irregular arrangement 
of atoms within the lattice. On page 69 of 
this issue, Katagiri et al. (1) report imaging 
experiments on shocked diamond single 
crystals. Using an x-ray free-electron laser 
(XFEL) as an ultrafast probe, they observed 
shock-induced dislocation motion that was 
faster than the speed of sound in diamond, 
called transonic dislocations. This experi- 
mental evidence is an important benchmark 
for the theory of dislocation dynamics and 
could aid designing stronger materials. 
Deformation induced by a shockwave is 
macroscopically restricted to the direction 
of shock propagation. At low stresses, below 
the material’s elastic limit, this deforma- 
tion is reversible and is also microscopically 
uniaxial, meaning, with no deformation in 
lateral directions. Shock amplitudes above 
the elastic limit generate sufficiently high 
stress that the material can no longer re- 
sist lateral deformation and undergoes ir- 
reversible plastic deformation. Details of 
dislocation dynamics—that is, the genera- 
tion, interactions, and movement of crystal 
defects—provide information on the ability 
of a material to resist deformation (the ma- 
terial’s strength) and its damage and fail- 
ure response (how a material breaks) (2). 
Theory and simulation both provide insight 
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into dislocation dynamics. Recent 
advances in high-performance 
computing have enabled simula- 
tions to investigate dislocation 
dynamics, including direct shock 
wave propagation (3). However, 
there has been a dearth of experi- 
mental studies testing the predic- 
tions of theory and simulation, 
particularly in real time, as shock 
waves propagate in a crystalline 
(or solid) material. 

Deformation processes in crys- 
talline materials that occur dur- 
ing dynamic compression are tra- 
ditionally studied by using wave 
profile measurements or recovery 
experiments (4). Wave profile mea- 
surements use Doppler-frequency- 
shifted laser light reflected from a 
sample surface to infer its surface 
velocity—similarly to a radar gun 
measuring the velocity of a car on 
the highway. Wave profile measure- 
ments provide an integrated, mac- 
roscopically and spatially averaged 
response and are typically not sen- 
sitive to microscopic phenomena 
(including dislocation formation 
and propagation) associated with 
deformation mechanisms in solids. 
Recovery experiments, which allow 
the “interrogation” of shocked sam- 
ples after the dynamic event using 
numerous microscopic techniques, 
can provide insight into deforma- 
tion mechanisms. However, it is dif- 
ficult to separate the dislocation dynamics 
occurring during the submicrosecond shock 
event with subsequent deformation on the 
seconds timescale, which is associated with 
shock recovery. 

Recent advances in the coupling of dy- 
namic compression drivers (controlled 
shock wave generators such as gas guns and 
large lasers) to high-brightness x-ray light 
sources have opened new windows into de- 
formation studies during high strain-rate 
compression, allowing imaging of materi- 
als in time-resolved experiments. Over the 
past several years, dynamic compression 
capabilities have been added at synchro- 
tron light sources such as the Advanced 
Photon Source (APS) (5) and the European 
Synchrotron Radiation Facility (ESRF) (6); 
and XFEL sources such as the Spring-8 
Angstrom Compact Free Electron Laser 
(SACLA) (7) and the Linac Coherent Light 
Source (LCLS) (8). The extreme bright- 
ness and coherence of the light from these 
sources provide copious photons in sub- 
nanosecond bursts or pulses that are re- 
quired to penetrate micrometer- to millime- 
ter-scale samples during dynamic events. 
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X-ray radiography images show a shockwave propagating in a diamond 
crystal at early (8 ns, top) and late (500 ns, bottom) times. 


The work by Katagiri et al. is an example 
of the types of experiments that have been 
enabled through these recent advances. 
The authors used an intense nanosecond- 
scale, visible-light laser to drive ~100-GPa 
shocks into single-crystal diamond sam- 
ples, synchronizing the visible laser and 
the XFEL such that the shock occurred a 
few to tens of nanoseconds before the x-ray 
(XFEL) burst. Radiography measurements 
revealed substantial photon scattering off 
stacking faults (a type of planar defect) 
along specific crystallographic planes 
within the plastically deformed region of 
the sample. This enhanced scattering al- 
lowed them to “see” these planar defects 
within the crystal. Once initiated, these 
defects propagated, closely tracking the 
leading shock front. Combining multiple 
image frames enabled them to measure 
the dislocation motion (see the images). 
The inferred velocities reached the speed 
of sound and thus provide experimental 
evidence of transonic dislocation propa- 
gation. This is an important benchmark 
because it allows better calibration with 
theory and simulation results. 


There are several reasons why 
Katagiri et al. focused their study on 
diamond. Diamond is a prototypical 
brittle ceramic and has long been of 
scientific and technological interest 
because of its exceptional mechani- 
cal, optical, and thermal properties. 
Critically for the experiments of 
Katagiri et al., however, diamond 
also exhibits high transmission of 
low energy, so-called soft x-rays 
(with energies of less than ~10 keV). 
Although XFELs produce extremely 
high x-ray flux, the usable energy 
range is currently limited to a few 
to 10 keV. This restricts the types of 
materials that can be studied with 
these techniques. 

This restriction should be lifted 
in the coming years. Major invest- 
ments in high-brightness light 
sources, such as the LCLS (8) and 
APS upgrades (9), promise higher 
x-ray flux, and more notably, high 
flux at high energy, so-called hard 
x-rays (10 to 40 keV). This will en- 
able similar types of investigations 
on higher-density, polycrystalline 
materials, in which dislocation dy- 
namics on grain boundaries (that 
occur between micro- or nanocrys- 
tals) play a major role in determin- 
ing material strength and resistance 
to damage (JO). Enhanced scien- 
tific understanding of deforma- 
tion mechanisms in crystals gained 
through these studies may aid in 
the design of engineered materials that are 
more resistant to deformation and failure. 
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Y chromosome proteins in female tissues 


Unreliable protein-based tools are impeding sex-based research 


By Bradley D. Gelfand and 
Jayakrishna Ambati 


he role of sex in health and disease is 

of central concern to biomedical re- 

search, as reflected by policies from 

the US National Institutes of Health, 

European Commission, and Canadian 

Institutes of Health Research that 
require investigators to consider sex as a 
biological variable (7). Mechanistic under- 
standing of the role of sex in human health 
is essential to develop better and more- 
personalized care (2, 3). Because the inclu- 
sion of both sexes in basic, translational, 
and clinical research is now the rule rather 
than the exception, it is necessary to ex- 
amine the mechanisms responsible when 
sex differences are inevitably observed. 
Protein-based resources, which make up 
an essential mechanistic research toolset, 
have not been developed for the specific 
challenges presented by sex chromosome- 
encoded proteins. Consequently, an alarm- 
ing number of protein-based resources are 
erroneous, inaccurate, or misleading by, 
for example, reporting Y chromosome-en- 
coded protein expression in cells and tis- 
sues that lack a Y chromosome. 

The contemporary theory of mamma- 
lian sexual differentiation (4) establishes 
a framework for identifying mechanistic 
causes of sexual dimorphism (phenotypes 
that occur more often in a particular sex). 
This theory states that sexually dimorphic 
phenotypes can largely be attributed to 
the three basic categories of sex-biasing 
factors: permanent or organizational ef- 
fects of gonadal hormones, which typically 
happen around the time of birth or before 
(e.g., testosterone causes formation of a pe- 
nis and scrotum in fetuses); reversible or 
activational effects of gonadal hormones in 
adulthood, or at the time of the experiment 
(e.g., the transient anti-inflammatory ac- 
tivities of estrogens); and sex chromosome 
effects owing to the unequal cell-autono- 
mous activities of X and Y gene expression 
in, for example, XX and XY cells. 

The influence of sex hormones on bio- 
logical processes cannot be overstated, 
but strong evidence points to sex chro- 
mosomes playing crucial roles in human 
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health beyond reproductive organization 
and function. For example, Jacobs syn- 
drome, caused by an extra copy of chromo- 
some Y (XYY), is associated with increased 
prevalence of asthma, autism spectrum 
disorder, and seizures (5). The extent to 
which disorders of the complement of sex 
chromosomes exhibit presentations be- 
yond reproductive function indicates that 
sex chromosomes can contribute much 
more broadly to human health. Recent 
work on the somatic mosaic loss of the Y 
chromosome further indicates key roles for 
Y chromosome-encoded genes in a variety 
of complex diseases and in human lon- 
gevity (6, 7), including in cardiac fibrosis 
and mortality (8). Experimentally elegant 
genetic tools have revealed key roles for 
sex chromosomes in a host of biologic and 
pathologic processes, including behavior, 
cardiovascular diseases, immunity, me- 
tabolism, and Alzheimer’s disease (9). In 
addition to chromosome-wide disorders, 
widespread transcription of individual Y 
chromosome-encoded genes outside of re- 
productive organs has been reported, and 
evidence indicates that individual gene 
dosage differences may contribute to un- 
derlying sex differences (10). 

These observations highlight that sex 
chromosome-encoded genes may contrib- 
ute to well-documented dimorphism in 
the incidence, severity, and even treatment 
efficacies for human diseases. Identifying 
the specific gene or genes on sex chromo- 
somes that are responsible may unlock new 
therapeutic paradigms to restore, enable, 
or silence sex-specific gene function. For 
example, it may be possible to reduce the 
incidence of disease in females to match 
that of males by introducing a cohort of Y 
chromosome-encoded genes into affected 
cells and tissues. Unlocking this potential 
requires a detailed mechanistic under- 
standing of sex chromosomes in human 
health through molecular characterization 
of gene expression and protein function. 

Investigating gene function for sex chro- 
mosome-encoded genes presents distinct 
challenges. A cohort of sex chromosome 
genes includes ancestrally related X and 
Y chromosome gene pairs, or gametologs, 
which reside on the X and Y chromosomes. 
These gene pairs encode proteins with 
amino acid sequence homologies that can 
exceed 90% (JO), indicating that, at least in 


part, they have overlapping functions (see 
the figure). However, emerging evidence 
suggests that these X-Y gene pairs can ex- 
hibit distinct activities that may contrib- 
ute to sexually dimorphic phenotypes. For 
example, despite 92% amino acid identity 
with its X chromosome gametolog, DEAD 
box helicase 3 X-linked (DDX3X), the Y 
chromosome-encoded DDX3Y exhibits 
weaker adenosine triphosphatase (ATPase) 
activity, greater propensity for liquid- 
liquid phase separation, and altered RNA 
metabolism (17). The extent to which dif- 
ferences in DDX3 or other highly related 
gametologs contribute to sexually dimor- 
phic phenotypes is not yet known. 

Many genomic and transcriptomic re- 
sources accurately distinguish human 
gametologs. For example, in the Geno- 
type-Tissue Expression Project, a compre- 
hensive atlas of human mRNA expression, 
tissues from female donors show almost no 
transcripts mapping to Y chromosome-en- 
coded genes (www.gtexportal.org). None- 
theless, it is important to be vigilant that 
mapping of transcripts to sex chromo- 
somes is accurate and does not result in 
over- or underestimates of the expression 
of either gametolog. 

Unfortunately, many protein-based re- 
sources are unable to distinguish between 
X and Y chromosome-encoded gameto- 
logs. A survey of commercial antibodies 
purporting to recognize Y chromosome- 
encoded proteins identified more than 60 
such antibodies with supplier-provided 
marketing materials that report positive 
immunoreactivity in female cells and tis- 
sues (72). It is also troubling that the Hu- 
man Protein Atlas, an open-access pro- 
gram to map all human proteins in tissues, 
reports that female tissues display expres- 
sion of Y chromosome-encoded genes, in- 
cluding, for example, gliomas from females 
with positive immunostaining of neuro- 
ligin 4 Y-linked (NLGN4Y); breast can- 
cers from females positive for zinc finger 
protein Y-linked (ZFY); and ubiquitously 
transcribed tetratricopeptide repeat con- 
taining Y-linked (UTY) expression in the 
neurons of normal female brains (13). 

Some female tissues may have bona fide 
Y chromosome-derived antigens in cases 
of microchimerism, in which an individual 
harbors a small number of cells from genet- 
ically distinct individuals by way of a preg- 
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nancy with an XY fetus, a dizygotic twin, 
organ transplant, or blood transfusion (7/4). 
However, according to the survey (72), most 
of these primary antibodies are validated 
in female monoclonal cell lines, and data 
from the Human Protein Atlas demonstrate 
Y chromosome protein immunoreactivity in 
neurons of normal brain tissue (unlikely to 
have been the subject of transplantation). 
The most parsimonious explanation is that 
these represent false positive signals arising 
from poorly characterized antibodies. 

This problem also extends beyond immu- 


and label sex-specific peptides and proteins 
in publicly available antibody labeling and 
mass spectrometry data. This improvement 
can be implemented without developing 
new technologies but by better annotating 
existing protein expression databases and 
antibody datasheets for X and Y chromo- 
some-encoded genes. Antibodies target- 
ing gametologs should be assumed to be 
incapable of distinguishing between X and 
Y chromosome gene pairs unless explicitly 
validated for such applications. Antibody 
suppliers should relay this information, and 


The role of sex chromosomes in disease 

Although gametologs encoded on the X and Y chromosomes are highly homologous, they may have distinct 
functions [such as DEAD box helicase 3 X-linked (DDX3X) and DEAD box helicase 3 Y-linked (DDX3Y)]. 
Typical protein-based assays are unable to distinguish between gametologs, thwarting research of sexual 
dimorphism in disease. The copy number of gametologs in XX, XY, XO (Turner syndrome), mosaic loss 

of Y chromosome (mLOY), mosaic loss of X chromosome (mLOX), XYY (Jacobs syndrome), and XXY 
(Klinefelter syndrome) may contribute to a variety of phenotypes or diseases whose prevalence, severity, 


symptomatology, or risk factors are sexually dimorphic. 


X-Y ancestral gene pairs (gametologs) 


xX Y 
DDX3X DDX3Y 
Protein-based analyses 


1 


XX 


noassays because there are many publicly 
deposited protein sequencing studies with 
peptides mapped to Y chromosome-en- 
coded genes in female cells and tissues. For 
example, proteomic analysis of the Cancer 
Cell Line Encyclopedia reports expression of 
Y chromosome-encoded ribosomal protein 
S4 Y-linked 1 (RPS4Y), lysine demethylase 
5D (KDM5D), DDX3Y, UTY, and ubiquitin- 
specific peptidase 9 Y-linked (USP9Y) in fe- 
male cancer cell lines (/5). 

Reliable and accurate protein resources 
are essential for understanding expression 
and function of all genes, and especially sex 
chromosome-specific genes. These prob- 
lems could be addressed in several ways. It 
is important to more precisely categorize 
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researchers should state this explicitly when 
discussing results. With respect to other re- 
sources of protein expression data, caveats 
about the relative confidence and ambiguity 
of peptide mapping between X-Y gene pairs 
should be made explicit. This will improve 
the accuracy and transparency of existing 
materials, methods, and _ interpretations 
thereof. 

In addition to research specifically tar- 
geted toward understanding the biology of 
sex chromosome-based differences, gameto- 
logs should be used to stress test the limits 
of specificity and sensitivity for protein iden- 
tification and quantification more broadly. 
The existence of Y chromosome-derived 
proteins in female cells in the Cancer Cell 


Line Encyclopedia (15) implies that other 
protein pairs or families with similarly high 
homology are probably being misidentified. 
Although such misidentifications would be 
more pernicious to detect, the proportion 
of misidentified gametologs could serve as 
a proxy for the rate at which other high-ho- 
mology pairs are being misidentified. Using 
this benchmark false positive rate could be 
harnessed to classify the likelihood of mis- 
matched protein identification for every 
target in every sample type. The acknowl- 
edgment of such specificity limitations by 
producers and consumers is key to the accu- 
rate and transparent use of these resources. 

Validated resources should be created 
that can precisely distinguish between sex 
chromosome-specific proteins, including 
antibodies and protein mass spectrometry 
protein identification and quantification al- 
gorithms. This may require the development 
and implementation of new approaches for 
validating antibody and mass spectrometry 
throughputs. For example, antibody pro- 
duction should design antigenic peptides 
toward regions of sequence dissimilarity 
among gametologs, cross-adsorb antisera 
against the opposing gene pair member, 
and, in the case of Y chromosome-encoded 
antigens, validate negative antibody label- 
ing in materials without Y chromosomes. 
Improving the quality and transparency of 
protein-based molecular resources will re- 
quire the collaborative efforts of producers 
and consumers. Such efforts will greatly 
enhance the ability to identify molecular 
mechanisms of sexual dimorphism in hu- 
man health and disease. 
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CARBON MARKETS 


Confronting deep uncertainty 
in the forest carbon industry 


Resilience-based and service-focused approaches 
could reduce contentions and injustices 


By Geoff Wells'??, Unai Pascual**, Chris 
Stephenson‘, Casey M. Ryan* 


lobal momentum on carbon markets 

has the potential to direct substan- 

tial capital toward protecting the 

world’s forests. Yet the billion-dollar 

forest-carbon-offsetting industry is 

attracting criticism, in part from 
doubts about the methods used to mea- 
sure and causally attribute changes in tree 
cover and biomass (7). Many actors in the 
industry are thus pursuing increasingly 
detailed measurement and monitoring of 
carbon outcomes and risks, under the as- 
sumption that this will improve accuracy 
and offset integrity (2). However, mounting 
scientific evidence (3, 4) implies that many 
forest landscapes are subject to “deep un- 
certainty” (5), such that claims of high ac- 
curacy in assessing carbon change are likely 
to remain inherently contestable, regard- 
less of the technology or methodology de- 
ployed. Further, demands for such accuracy 
are likely to perpetuate inefficiencies and 
injustices among carbon suppliers (6-9). 
Approaches from other sectors may offer 
alternative ways forward in the absence of 
highly accurate measurements of outcomes. 


DEEP UNCERTAINTY 

Markets thrive on good information. Buyers 
and regulators in the forest carbon market 
thus demand this from carbon suppliers, 
certification standards, monitoring ser- 
vices, and rating agencies. In turn, many of 
these “supply-side” actors are incentivized 
to differentiate their offerings through com- 
peting claims of superiority in certainty, 
transparency, and even “truth” (0). Supply- 
side actors survive on their ability to con- 
vincingly quantify and causally attribute 
carbon change. 
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But the science is far from settled on 
how (or even whether) it is possible to do 
this in a robust way. Commodification of 
forest carbon relies on the principles of 
additionality, leakage, and permanence. 
Additionality requires that changes in 
carbon sequestration or stocks would not 
have occurred in the counterfactual (e.g., 
business as usual) scenario (also known as 
the baseline); controlling for leakage re- 
quires that interventions do not increase 
emissions elsewhere. Permanence requires 
that carbon stocks remain sequestered on 
decadal or centennial timescales. 

These principles were developed as a po- 
litical imperative in the 1990s and 2000s to 
operationalize global agreements on carbon 
accounting. The assumption that we can 
meaningfully measure and attribute forest 
carbon change has subsequently become 
deeply engrained in market rules and in 
buyers’ expectations (6). To assess these 
principles, the market relies on estimated 
changes in tree cover and biomass as well 
as assessments of their causality and risk. 
The skills and costs of such analytics and 
monitoring remain the main barrier to en- 
try for projects (2, 11), and investments in 
new commercial analytical services reach 
into the tens of millions of dollars (10). The 
perceived accuracy of these analyses by 
a given project is also often equated with 
overall carbon offset quality and underpins 
perennial disagreements about the effec- 
tiveness of many projects (1). 

Yet despite deeply entrenched market 
expectations about the measurability of 
forest-carbon change, recent science tells us 
that these analyses—and thus the commod- 
ity of forest-carbon offsets—are likely to re- 
main uncertain. Forest landscapes are com- 
plex and dynamic systems in which tree and 
soil carbon stocks fluctuate in response to 
diverse interactions between natural (e.g., 
tree mortality, competition, pests, herbiv- 
ory, and natural fires) and anthropogenic 
drivers (e.g., land clearing, wood harvest- 
ing, and tree planting). Critically, the rela- 
tive influences of these drivers vary greatly 


even across small areas and are usually ore 
partially understood, and only partially uv-— 
servable (3). 

In ecology, this evidence has seen chal- 
lenges to theories of climax communities 
and single stable states (e.g., where undis- 
turbed vegetation is assumed to continu- 
ally increase in biomass before reaching a 
predictable and stable maximum) by more 
dynamic understandings of nonlinearity, 
hysteresis, and alternative stable states (4). 
The social context of carbon projects adds 
further uncertainty. Mounting evidence on 
the complexity of socioeconomic dimen- 
sions of land use suggests that past observa- 
tions of land-use change are not necessarily 
a good guide to the future and that complex 
social feedbacks often lead to unintended 
consequences (3). Overall, strong ecological 
and socioeconomic evidence now exists that 
forest carbon stocks fluctuate in ways that 
are hard (and sometimes impossible) to 
precisely understand, predict, and control. 

These advances sit uncomfortably with 
claims of high certitude about projects or 
policies that cause forest carbon change. 
Certainty in additionality and leakage would 
imply that it is possible to reliably measure 
biomass change and that one can robustly 
attribute causality (e.g., through objective 
counterfactual scenarios and _ baselines). 
Permanence would imply that forest bio- 
mass can be made to reach something akin 
to a climax community with stable levels of 
carbon. But a conservative interpretation of 
the present science indicates that, no mat- 
ter the technologies deployed, causal attri- 
butions of forest change and expectations 
of quasi-stable biomass will remain contest- 
able (even where claims are retrospective, 
e.g., in ex post crediting models). That is, 
many forest carbon interventions operate 
in something akin to “deep uncertainty’—a 
world of unknown unknowns (5). 

Deep-type uncertainty subsequently 
points to a more fundamental tension 
between dominant forest carbon-offset 
methodologies and advances in the under- 
standing of complexity in socioecological 
systems. Even if it were possible to robustly 
assess changes in a singular system-state 
variable (e.g., tonnes of forest carbon se- 
questered), complexity science suggests that 
doing so would (on its own) provide little 
insight about whether a system has under- 
gone a Sustained (e.g., “permanent”) regime 
change (or transformation) to a new (e.g., 
higher carbon) state (12). Such changes are 
instead more closely associated with the 
broader stability of the socioecological sys- 
tem (e.g., the enduring environmental, so- 
cial, and economic factors that encourage 
or discourage changes in tree biomass) and 
the ability of an intervention to maneuver 
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(i.e., its adaptive capacity) to retain a desired 
state in the face of (unanticipated) shocks 
and stresses (i.e., its general resilience). 

Many carbon schemes do assess “risks” 
to additionality, leakage, and permanence 
from specific environmental, social, and eco- 
nomic factors. However, by definition, these 
risk assessments can only address “known 
knowns” that can be anticipated and about 
which one can estimate probabilities and im- 
pacts (5). They therefore can say little about 
the likelihood that an intervention will pre- 
vail in the face of unanticipated (and some- 
times incomputable) shocks and stresses. 
Thus, not only are quantifications of forest 
carbon outcomes and associated (specific) 
risks likely to remain contestable, but subse- 
quent claims of permanence are also likely to 
be inherently precarious. 


COSTS AND INJUSTICES OF CLAIMS 

TO ACCURACY 

In the context of entrenched assumptions 
and market expectations that forest car- 
bon change is measurable and attributable, 
supply-side actors have tended to respond to 
these challenges by investing in ever-more- 
detailed technical measures for assessing 
carbon outcomes and risk (e.g., conservative 
estimates of carbon removals, new modeling 
and monitoring technologies, offset insurance 
products, blockchain-enabled transparency 
measures, and automated risk assessments). 
Yet evidence of deep uncertainty in forest 
landscapes suggests that skepticism about 
carbon measurement and attribution will re- 
main a stubborn burden on the sector. 

At present, this burden manifests in dif- 
ferent ways. For one, it continues to cause 
disputes about the worth of different car- 
bon schemes (7)—disputes that may simply 
reflect the preferences of different analysts 
rather than any fundamental truths about 
carbon sequestration. Such disputes not only 
threaten the reputation (and thus the sur- 
vival) of these individual projects but also 
diminish market confidence more widely. 
Persistent uncertainty drives volatility in 
prices and revenues and has led to an arms 
race in the sophistication (and often cost) of 
competing (and still contested) assessment 
methodologies (2). Volatile prices and high 
compliance costs subsequently remain the 
main barriers to entry for suppliers, which 
now renders unviable up to 80% of potential 
projects in the tropics (/7). 

Questionable assumptions and expecta- 
tions around certitude also create overbur- 
densome compliance demands, which can 
in turn fuel unfairness and inefficiency. In 
the Global South, many carbon suppliers are 
local organizations, communities, or small- 
holders from poor areas. Often these suppli- 
ers have little capacity to negotiate the rules 
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to which they are subjected and can be pres- 
sured to redirect already-scarce resources to- 
ward questionably complex and costly com- 
pliance measures (6, 7). Suppliers can also be 
held to external standards of certainty and 
performance that are virtually unachievable 
(8). Such unfair, costly, and restrictive condi- 
tions can subsequently combine to hinder ef- 
ficiency and innovation (9). Ultimately, none 
of this is good for buyers, suppliers, or the 
climate. 


IDEAS FROM OTHER SECTORS 
Despite these challenges, a degree of path 
dependency and _ institutional lock-in 


around market-based approaches to cli- 
mate change mitigation implies that such 
approaches are likely to remain economi- 
cally and politically entrenched—and are 
likely to remain a key source of capital flows 
for forest conservation. Within this context, 
we point to existing approaches from other 
sectors that could inspire improved ap- 
proaches to uncertainty in the forest car- 
bon-offset industry. 

For one, health, financial, conservation, 
and other sectors sometimes deploy “value 
of information” approaches to assess when 
generating more-detailed data and informa- 
tion is worth the cost, relative to marginal 
changes in certitude, usability, and user- 
perceived legitimacy (73). Certification orga- 
nizations, regulators, and other rule-makers 
in the forest carbon industry could explore 
similar approaches to more objectively as- 
sess what level of complexity is actually war- 
ranted in measurement and monitoring for- 
est carbon stocks. More broadly, the industry 
could explore introducing a fundamental 
principle of parsimony in methodologies, 
rules, and assessments—where approaches 
should be no more complicated than they 
need to be and should focus on the most un- 
certain components. Key to this is recogniz- 
ing that accurately measuring carbon stocks 
is only a small part of assessing a project’s 
impact and quality. 

Another opportunity stems from a poten- 
tial shift from risk- to resilience-based ap- 
proaches to assessing the quality of projects, 
which consider not only known risks but 
also the capacity to handle the unknowns. 
Energy, finance, and other sectors approach 
deep uncertainty by investing in (and as- 
sessing) their technical, organizational, so- 
cial, and economic readiness to persist in 
the face of unknown shocks and stresses— 
that is, their adaptive capacity and general 
resilience (/4). Although care would need to 
be taken to not increase (and to preferably 
reduce) compliance costs, the forest carbon 
industry could emulate these sectors by 
moderating its emphasis on the precision 
of (in any case uncertain) estimated out- 


comes and instead more closely relate offset 
quality to general resilience and adaptive 
capacity of an intervention—for example, a 
general “stress test” based on the ecologi- 
cal context; the depth and breadth of skills, 
knowledge, labor, and resources available; 
and the strength of institutional and gov- 
ernance systems, associated inclusivity, and 
learning (12, 14). Higher adaptive capac- 
ity also usually engenders higher levels of 
stakeholder consultation and participation, 
thus also enhancing fairness and legitimacy 
(9). 

Somewhat more radically, some operators 
in these other (noncarbon) sectors address 
uncertainty by adopting service-focused 
models that allow trading to proceed in the 
near-total absence of accurate measurements 
of outcomes. By using “efforts standards” 
from contract law, buyers pay suppliers to 
maximize the delivery of a service, even if 
there is uncertainty about the precise level 
of final delivery (15). For example, internet 
service providers agree to undertake specific 
measures within their control (i.e., “best ef- 
forts”) to maximize the speed and stability of 
a service up to a maximum potential level, 
even if the actual level may be only approxi- 
mately known and may fluctuate because of 
events outside of their control (e.g., internet 
traffic, weather damage to infrastructure). 

For forest carbon suppliers, systems the- 
ory provides a basis for assuming that al- 
though the precise level of a desired system 
function (e.g., forest carbon sequestration) 
may not be knowable, it can be assumed to 
be maximized over time when actors (e.g., 


science.org SCIENCE 


d 
' 


> 


forest stewards) use their adaptive capacity 
(e.g., their land tenure rights and available 
capital) to maximize the general resilience 
of a desired system state (e.g., a landscape 
with more trees) to (often unanticipated) 
shocks and stresses (e.g., political and eco- 
nomic drivers of deforestation) (72). Thus, 
from a complexity perspective, approaches 
focused on maximizing adaptive capacity 
and resilience among carbon suppliers may 
be more appropriate for managing radi- 
cal uncertainty rather than pursuing ever- 
more-detailed (but inherently uncertain) 
measurements of carbon outcomes. 

The exact modalities of how such a service- 
focused approach could apply in practice 
would need to be explored, along with how it 
links to the presently entrenched, commod- 
ity-focused (i.e., payments by tonnes of car- 
bon) regulatory and market infrastructure. 
We do not seek to offer a definitive model 
here, but where there is a buyer there is a 
way, and judging by other sectors, a service- 
focused model may be sufficient for some. 

For example, the industry could examine 
approaches whereby carbon buyers sub- 
scribe to a scheme based on an approximate, 
parsimonious, and externally verified pro- 
jection of the expected “carbon return” (i.e., 
removals) over a period. Suppliers could be 
held to a service agreement that sets out 
best-effort thresholds (i.e., activities within 
the limits of their adaptive capacity) for 
maximizing this carbon sequestration and 
its resilience. Where these best efforts are 
met, the supplier would be rewarded and 
the buyer could make climate claims based 
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on the original expected projection. And 
this would occur even if the precise level of 
sequestration remains unknown and may 
have been higher or lower as a result of ex- 
ogenous influences. Such approaches may 
encourage buyers and suppliers to redirect 
investments toward adaptive capacity and 
resilience, and supply-side actors may be 
empowered to be more transparent about 
what they can actually predict, measure, 
and control. 

Activity-based monitoring is already 
sometimes used as a precursor to calculat- 
ing carbon change in the sector, in contrast 
to approaches that attempt to directly mea- 
sure carbon change (e.g., outcome-based 
methods). Complexity science suggests that 
activity-based approaches on their own may 
be sufficient (and more appropriate) for max- 
imizing carbon sequestration under deep un- 
certainty, where further meaningful post hoc 
assessments of carbon change and attribu- 
tion are, in any case, often not possible. 

Whatever the models that might emerge, 
service-focused approaches would likely 
entail a change in the value proposition to, 
and the expectations of, buyers and regula- 
tors in voluntary and/or compliance mar- 
kets whereby a buyer’s climate claims are 
based on making something akin to “con- 
ditional investments in sustainable and 
just future landscapes” rather than only 
purchasing specific amounts of (in any case 
contestable) carbon offsets. Such a model 
might also align with the emerging “car- 
bon-contribution” approach to mitigation 
claims, whereby purchased credits cannot 


Forest land near Ashford, 

“ Washington, is part of a 
~~ Project that allows the private 
nonprofit Nisqually Land 

Trust to sell carbon credits. 


“offset” a buyers’ emissions but are instead 
indicative of their broader commitment to 
addressing climate change. Under such an 
approach, approximate quantifications and 
conditional activity-based payments may 
suffice. 

Generally, we propose that by adopting a 
complexity-informed perspective and explor- 
ing alternative approaches to confronting 
deep uncertainty, the forest carbon industry 
(and potentially other ecological markets) 
may find new pathways for addressing many 
of the contentions and injustices that pres- 
ently exist in the sector. 
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Our oceans, ourselves 


An oceanographer’s expansive portrait of the 
marine world captures its vitality and complexity 


By Todd L. Capson 


n her new book, The Blue Machine, phys- 

icist and oceanographer Helen Czerski 

describes a dynamic “ocean engine’— 

a tightly woven system that converts 

sunlight into movement and life—writ- 

ing on scales ranging from bubbles 
to the Gulf Stream and from krill to blue 
whales and highlighting the ocean’s vital 
connections to human history and culture. 
Her description of the ocean, including its 
complexity and diversity, is vast and com- 
prehensive. Yet The Blue Machine 
is a point of departure, a map for 
further exploration. Not since 
reading The Diversity of Life by 
E. O. Wilson have I read a book as 
timely, salient, and informative. nr 

Czerski explains a broad range 
of important phenomena, proper- 
ties, and concepts and provides 
examples that are engaging and 
illuminating. The following sum- 
maries illustrate her approach. 
She describes, for instance, how 
the sunlit upper layer of the 
ocean provides heat energy to the ocean and 
sculpts global weather patterns. Underneath 
are thicker layers of varying salinity and 
temperature, typical of ocean basins. She 
renders the abstract concrete, describing the 
battle between Mark Antony and Octavian in 
31 BCE in which Antony’s ships were possi- 
bly stymied by “dead water’—internal waves 
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The Blue Machine: 
How the Ocean Works 
Helen Czerski 
Norton, 2023. 464 pp. 


in denser, saltier water below that impede 
forward motion. Octavian would go on to es- 
tablish the Roman Empire. 

The Blue Machine illustrates the critical 
role of density to the physical structure of 
the ocean with a description of the Denmark 
Strait Overflow, where 3 million m® per sec- 
ond of dense, cold, salty Arctic water flows 
underneath the Atlantic. This feature is an 
important component of the thermohaline 
circulation system that transports warm 
water from the tropics to the poles. Czerski 
explains how salinity has major repercus- 
sions on the physical structure 
and machinery of the ocean and 
its inhabitants, such as the leath- 
erback turtle, which copes with a 
daily salt intake of 10 kg by crying 
8 liters of salty tears every hour. 
She describes the overall shunting 
of energy from equatorial regions 
to the poles and how polar regions 
play critical roles for the flow of 
energy from Earth into space. 
She explains what the North and 
South Poles have in common and 
how they differ. 

Czerski writes of the travails and joys 
of being an ocean scientist, including an 
expedition to study the exchange of gases 
between the ocean and atmosphere in the 
middle of a storm in the North Atlantic. 
She writes of the importance of collecting 
data in the field that will keep computer 
models from “stray[ing] into the world of 
computer games.” 

Describing her initial exposure to ocean 
science as a postdoc, after obtaining three 
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Leatherback sea turtles keep the ocean’s salinity upc 


bay by constantly expelling salty tears. 


physics degrees, Czerski reflects on her ex- 
asperation that “no one had ever mentioned 
the ocean.” Her reaction reminded me of the 
response of Mireya Moscos, then president 
of Panama, to a presentation on why the 
Coiba National Park was worthy of protec- 
tion (“Why didn’t anyone tell me?”) (1). 

Czerski writes about society’s tendency 
to see the ocean as “away,” a solution to the 
Great Stink of London in 1850, when excre- 
ment was pumped into the North Sea until 
returning tides and fouled fisheries forced a 
better solution. Nature, on the other hand, 
puts waste to good use, as demonstrated 
by the blue whales in the Southern Ocean, 
whose generous clouds of excrement bring 
nutrients into sunlit surface waters to be 
recycled by the splendidly named krill 
Euphausia superba. 

The Blue Machine provides important 
context to help readers better understand 
humanity’s impacts on the ocean. We have 
demolished the balance of the global car- 
bon budget by burning fossil fuels, result- 
ing in what Nicolas Gruber of ETH Ziirich 
calls the “triple whammy” of ocean acidi- 
fication, warming, and deoxygenation (2). 
We now observe unprecedented ocean 
surface temperatures that supercharge 
storms and alter global weather patterns, 
pervasive hypoxia on global coral reefs, 
and warnings of a forthcoming collapse of 
the Atlantic Meridional Overturning Cir- 
culation, among other challenges. The tra- 
ditional Polynesian navigation techniques 
that Czerski describes—underpinned by 
teamwork, humility, and a profound un- 
derstanding of the natural world—provide 
a model for addressing these threats, a 
stark contrast to the hubris of interven- 
tions such as geoengineering. 

My only reservation about the book is 
Czerski’s use of metaphors and other liter- 
ary embellishments that sometimes get in 
the way of her otherwise clear and descrip- 
tive prose—although she is entitled to some 
creative license for having written a tome 
that is both substantive and accessible. Ul- 
timately, The Blue Machine is a major con- 
tribution to our shared responsibility of 
educating, inspiring, and informing societal 
values about the ocean. & 
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The mRNA enthusiast’s memoirs 


A molecular biologist documents how her passion 
for the messenger molecule paid off 


By Vijaysree Venkatraman 


ne day in 1997, University of Penn- 
sylvania molecular biologist Katalin 
(Kati) Karik6 met a new hire at the 
photocopier who would change the 
trajectory of her research forever. 
At 42, the scientist had made little 
headway with her big idea that messenger 
RNA (mRNA)—the ephemeral molecule 
that instructs cells to make proteins—could 
be commandeered to help the body fight 
disease. Her grant proposals to 
study the therapeutic potential 
of mRNA continued to get re- 
jected. She had no budget, no 
staff, and very little prestige. 

Karik6 introduced herself to 
the “photocopier-hogging im- 
munologist,’ Drew Weissman, 
and asked about his work. He 
planned to create vaccines 
against infectious diseases 
and discussed ways to deliver 
an antigen into cells to trig- 
ger the body's own immune 
response. At once, Kariké saw 
another prospective role for 
her pet molecule, and a long- 
standing collaboration was 
born that laid a strong founda- 
tion for the mRNA-based vac- 
cines that proved vital during 
the COVID-19 pandemic. In 
Breaking Through,  Karik0o’s 
page-turning memoir, the tal- 
ented and tenacious researcher 
recounts the origins and evolution of her 
scientific journey. 

Karik6 grew up in a small town in 
Hungary, the daughter of a skilled butcher. 
There, a high school biology teacher intro- 
duced her to endocrinologist Hans Selye’s 
The Stress of Life—a book that would reso- 
nate with her both as an introduction to 
scientific thinking and as a guide to living 
well. Nature “rarely replies to questions un- 
less they are put to her in the form of experi- 
ments, to which she can say yes or no,” wrote 
Selye in that book in 1956. 

Selye’s observations helped Karik6 real- 
ize that she wanted to spend her life design- 
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ing experiments that would contribute to 
human knowledge. Her parents encouraged 
her to become a kutat6—Hungarian for 
“searcher’—or, in English, a researcher. 

She earned her PhD from the presti- 
gious Biological Research Centre in Szeged, 
Hungary, where she began her exploratory 
research on medicinal applications for RNA. 
When her funding was pulled, she found a 
postdoc position in Philadelphia. She moved 
to the United States with her husband and 
toddler, Susan, with the family’s meager sav- 


Kariké’s hunch that mRNA could be used to fight disease proved correct. 


ings sewn into the little girl’s teddy bear. 

“Do not blame,” wrote Selye. “Focus on 
what you can control. Transform bad stress 
into good stress.” This advice would serve 
Kariko well over the course of her nearly 
four-decade-long academic career, a period, 
she writes, in which she was belittled, ig- 
nored, demoted, and even threatened with 
deportation when her research did not win 
government grants or private funding. 

Still, nothing upset her as much as one 
early lab finding: Synthetic mRNA triggers 
an inflammatory immune response in cells. 
An mRNA vaccine might still have been pos- 
sible, Karik6 and Weissman reasoned, but 
mRNA would be useless as therapeutic medi- 
cine until they figured out a way to keep it 
from activating the immune system. 
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Breaking Through: 
My Life in Science 
Katalin Karik6é 
Crown, 2023. 
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In 2005, when the pair reported that a 
modified mRNA produced a noninflamma- 
tory immune response in cells, their land- 
mark finding (J), which would eventually 
be licensed by biotech companies, went 
unheralded. In 2013, in a rude reminder 
that lab space is often allocated on the ba- 
sis of the funding a researcher brings in, 
Karik6 was unceremoniously displaced. 
“T showed up to my lab in the neurosurgery 
department and found my belongings in 
the hallway,’ she writes. She was moved to 
a tiny room that was not con- 
ducive for working with mRNA. 
This was the final straw. 

That year, Kariké received 
job offers from two companies 
that had licensed the modified 
mRNA technology: BioNTech in 
Germany and Moderna in the 
United States. When she packed 
her bags and left temporarily for 
Germany, her daughter—by now 
a two-time Olympic gold medal- 
ist—and husband stayed behind. 

Karik6 compares her science 
with her daughter’s rowing. 
The crew rows backward, she 
observes, without a view of the 
finish line. Just as she has done 
in the lab, they must trust that if 
they give it their all, their hard 
work will bring them to the 
right destination. 

Karik6’s discovery found a 
meaningful application during 
the COVID-19 pandemic, but the 
story of synthetic mRNA technology will not 
end here. Scientists are studying its therapeu- 
tic potential for various cancers, cystic fibro- 
sis, and rare metabolic disorders, as well as 
for vaccines against other infectious diseases, 
Karik6 writes. In the next decade, she predicts, 
there will be an explosion of new mRNA thera- 
pies and vaccines. Her moving memoir offers 
a fascinating glimpse into the dead ends and 
hard work that led up to this important dis- 
covery, while highlighting critical shortcom- 
ings in the scientific reward system. & 
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Historic introductions 


We asked young scientists this question: If you could introduce any two 
scientists, regardless of where and when they lived, whom would 
you choose, and how would their collaboration change the course of 
history? Read a selection of the responses here. Follow NextGen Voices 
on social media with hashtag #NextGenSci. 


Improved medicine 


Hungarian-American biochemist Katalin 
Karik6o has contributed to the mRNA 
technology that made the COVID-19 vac- 
cine possible. French virologist Francoise 
Barré-Sinoussi first identified HIV (along 
with Luc Montagnier). Both scientists 
are alive today, but I would have liked 

to introduce them in 1983, when Barré- 
Sinoussi first made her HIV discovery. 
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The collaboration between these two great 
minds decades ago could have changed 
the course of multiple pandemics. 


Keren Bismuth 
Paris, France. Email: bismuthk2@gmail.com 


French physicist Marie Curie (born in 
Poland) lost her life in 1934 to aplastic 
anemia caused by radiation exposure. 
German immunologist Paul Ehrlich, who 
lived at the same time, first identified the 


sah ie 


cause of aplastic anemia. I would have 
liked to introduce these two scientists in 
time for Ehrlich’s knowledge to prevent 
Curie’s death. Together, they could have 
educated the world about the negative 
effects of radiation, and perhaps even 
worked on a cure for aplastic anemia. 
Vandana Sharma 

Department of Hematology, All India Institute of 


Medical Sciences, New Delhi, New Delhi, India. 
Email: sharmavandana.phd@gmail.com 


Santiago Ramon y Cajal, born in Spain 
in 1852, is said to have been so bril- 
liant that simply by looking through 
early 1900s microscopes, he was able to 
intuit the role of specific neurons. What 
if Cajal had been able to meet one of 
today’s pioneers in the intersection of 
technology and neuroscience? French 
neuroscientist Grégoire Courtine’s recent 
advancements include developing an 
integrated brain-computer interface to 
recover walking after spinal cord injury. 
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Cajal’s discoveries were limited only by 
the tools available to him. If paired with a 
researcher capable of driving technology 
forward, it’s hard to envision what prob- 
lems in neuroscience couldn’t be solved. 


Jackson Ross Powell 

Vagelos Molecular Life Sciences Program, 
University of Pennsylvania, Philadelphia, PA 19104, 
USA. Email: jrp24@sas.upenn.edu 


A collaboration between Hua Tuo, a 
second-century Chinese surgeon, and 
Leonardo da Vinci, a 15th-century Italian 
architect and anatomist, would combine 
ancient Chinese medical wisdom with 
Renaissance artistic brilliance. Hua Tuo 
could impart the essence of traditional 
Chinese medicine, including acupuncture 
and herbal remedies, to da Vinci, allowing 
him to portray the human body in a more 
holistic way. In return, Leonardo’s meticu- 
lous anatomical studies and innovative 
scientific approaches could revolutionize 
Hua Tuo’s medical practice, enhancing 
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surgical techniques and diagnostics. 
Their collaboration might have ignited a 
comprehensive healthcare approach that 
blended Eastern and Western wisdom 
and altered how the human body is 
understood, treated, and depicted. 
Haoxian Tang 


Shantou University Medical College, Shantou, 
Guangdong, China. Email: 19hxtang@stu.edu.cn 


Leonardo da Vinci, 15th-century Italian 
painter, engineer, architect, and inven- 
tor, and Richard Feynman, 20th-century 
US theoretical physicist, could have 
worked together to revolutionize medi- 
cal technology. Da Vinci’s inventions 
include “Leonardo’s robot,” a functional 
automaton, which may have served as 
the inspiration for today’s widely used 
robotic surgical systems. If da Vinci 

had met Richard Feynman, the father 

of nanotechnology, perhaps they could 
have created “Leonardo’s nano-robot,’ a 
micromachine used to remove bacteria 
and viruses, repair blood vessels, and kill 
tumor cells accurately in vivo. 

Bo Cao 

Core Research Laboratory, The Second Affiliated 
Hospital, School of Medicine, Xi'an Jiaotong 


University, Xi'an, Shaanxi, China. 
Email: bocao@vip.qq.com 


German microbiologist Robert Koch, 
born in 1843, discovered the microbes 
that cause anthrax and tuberculosis and 
pioneered bacteriology. Scottish doctor 
Alexander Fleming, born in 1881, dis- 
covered penicillin, the first all-purpose 
antibiotic. If these two researchers had 
worked together, they could have made 
great strides in topics such as the traits 
of pathogens, the mechanics of transmis- 
sion, and antibiotic responses, leading 
to a better understanding of infectious 
diseases and more effective antibiotics. 
Jingtao Huang 


Shantou University Medical College, Guangdong, 
China. Email: 19jthuang@stu.edu.cn 


Robert Hooke, a 17th-century 

English physicist, developed the first 
microscope. This tool would have 
enabled ancient Greek anatomist 
Herophilus, who lived in the 3rd 
century BCE, to extend his studies 
beyond macroscopic explorations of 
the human body. Having described 

the retina and optic nerve, Hooke’s 
microscope would have enabled 
Herophilus to discover, for instance, 
the many cell types composing these 
structures. Herophilus’s embrace of the 
scientific method and experimentation 
would have led to huge strides in 


demystifying the human body and its 
functionality. Instead of being chastised 
at the time as a useless endeavor, 

the study of anatomy would have 
flourished, fundamentally transforming 
the course of medicine and human 
health. 

Rishi Jai Patel 

Department of Chemistry, University of 


Pennsylvania, Philadelphia, PA, USA. 
Email: ripatel@sas.upenn.edu 


Advanced technology 


Leonardo da Vinci, the 15th-century 
Italian polymath known for his engi- 
neering, architectural, and artistic 
prowess, and Nikola Tesla, a Serbian 
electrical and mechanical engineer 
born in 1856 in the part of the Austrian 
Empire that is now Croatia, possessed 
complementary knowledge, skills, 

and insights. Tesla’s understanding of 
electricity might have been the key to 
making da Vinci’s sophisticated designs 
feasible. Conversely, da Vinci’s wide 
variety of creative designs could have 
inspired Tesla to apply his electricity 
expertise to other fields. If the two of 
them had collaborated, helicopters and 
submarines might have been invented 
earlier, altering the course of history. 
Paulo Bezerra 

Department of Engineering and Technology, 
Universidade Federal Rural do Semi-Arido, Pau dos 
Ferros, RN, Brazil. 

Email: paulo.bezerra@ufersa.edu.br 


Ada Lovelace, born in England in 1815, 
was renowned for her contributions to 
early computer science. Her legacy as the 
world’s first computer programmer is 
primarily based on her groundbreaking 
work with Charles Babbage’s analytical 
engine—an innovative early mechani- 

cal general-purpose computer. Alan 
Turing, born in England in the 20th 
century, specialized in computer science, 
mathematics, and cryptography. He is 
celebrated for contributions to early 
computing, artificial intelligence, and 
cryptography during World War II. Their 
collaboration might have accelerated 

the development of early computers and 
algorithms, potentially giving rise to arti- 
ficial intelligence much earlier in history. 
Programmable computers in the 19th 
century could have reshaped communica- 
tion, cryptography, and automation as 
well as influenced historical events such 
as wars. 

Xuan Zhang 
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Geoffrey Hinton, a modern-day 
British-Canadian cognitive scientist 

and computer scientist, is a pioneer 

in artificial intelligence (AI). The use 

of AI is growing, but because of the 
complex mathematical relationships on 
which it depends, its internal workings 
remain largely a black box. Srinivasa 
Ramanujan, an Indian mathematician 
who lived at the turn of the 20th century, 
possessed an uncanny ability to invent 
mathematical tools to model difficult 
real-world problems. If they collabo- 
rated, Ramanujan’s skill could illuminate 
the tangled associations inside AI neural 
networks. Meanwhile, Hinton’s extensive 
experience in building AI algorithms 
and systems would be vital to implement 
Ramanujan’s mathematical theories as 
enhanced AI. Together, Ramanujan’s 
unique brilliance in crafting abstract 
mathematics and Hinton’s technical cre- 
ativity could unravel the cryptic core of 
AI. Their collaboration might produce AI 
that is more interpretable, controllable, 
trustworthy, and aligned with human 
values and ethics. 

Qianjun Wen 

The Affiliated Hospital of Guizhou Medical 


University, Guiyang City, Guizhou Province, 
China. Email: wqjtmmu@126.com 


Greater equity 


I would introduce 19th century English 
naturalist Charles Darwin and modern- 
day Brazilian psychologist Deisy das 
Gracas de Souza, who has explored how 
behavior patterns affect reading and 
writing skills. Both Darwin’s theory of 
evolution and the study of behavior 
evolution emphasize the role of the 
environment—Darwin focused on spe- 
cies’ adaptations over generations and 
de Souza studies how an individual’s 
behavior changes as a result of envi- 
ronmental consequences. Marrying 
Darwin’s evolutionary principles with de 
Souza’s expertise in learning might have 
revolutionized our understanding of how 
to arrange the environment to promote 
learning and positive change at both 
individual and cultural levels. 

Fernanda S. Oda 

Department of Applied Behavioral Science, 


University of Kansas, Lawrence, KS, USA. 
Email: oda@ku.edu 


Introducing 20th-century US virologist 
Jonas Salk, the ingenious mind 

behind the polio vaccine, and recently 
deceased physician Paul Farmer, the 
esteemed medical anthropologist 
dedicated to addressing healthcare 
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disparities, would produce a promising 
collaboration. Salk’s selfless decision not 
to patent his vaccine perfectly aligns 
with Farmer’s unwavering commitment 
to equitable healthcare access. By 
combining Salk’s breakthroughs with 
Farmer’s sociomedical perspective, they 
would revolutionize our approach to 
tackling epidemics and ensure global 
health for all. Salk would learn about 
the complex social determinants 
affecting healthcare access from Farmer, 
and Farmer could gain insights into 
cutting-edge scientific advancements 
from Salk. Their united vision would 
inspire a paradigm shift, emphasizing 
humanitarianism over profit, ensuring 
that lifesaving treatments and preventive 
measures are accessible to everyone, 
regardless of their background. 

Olesia Verstiuk 


University of Nicosia Medical School, Nicosia, 
Cyprus. Email: verstiuk.o@live.unic.ac.cy 


Abu Ali al Hasan ibn al-Haytham (known 
in the West as Alhazen), was a 10th- 
century mathematician and optician who 
lived in what is now Iraq. Introducing 
him to 17th-century English physicist 

Sir Isaac Newton would facilitate an 
unparalleled exchange of knowledge and 
ideas from two very different parts of the 
world, leading to revolutionary discover- 
ies in the field of light and optics. More 
importantly, this meeting would promote 
cultural exchange between some of the 
brightest minds of the Islamic world 

and the West, raising the profile of the 
contributions of Muslim scientists to the 
world’s intellectual heritage. The rich 
legacy of the Islamic Golden Age (often 
referred to as the medieval or Dark Ages 
of the West) is frequently overshadowed 
in Western historical narratives, so this 
encounter could pave the way for greater 
cross-cultural scientific exchange and 
mutual intellectual appreciation. 
Muhammad Shehryar Khan 

Department of Materials Science and Engineering, 


Massachusetts Institute of Technology, 
Cambridge, MA, USA. Email: mskhan@mit.edu 


What would have happened if the first 
public warning that climate change was 
an existential challenge for humanity had 
come before Edward Teller’s remarks in 
1959? A collaboration between Swedish 
physical chemist Svante Arrhenius, born 
in 1859, and modern-day US meteorologist 
Syukuro Manabe could have resulted in an 
earlier physical model of Earth’s climate 
system that allowed them to quantify the 
impact of human-induced climate change. 
A warning of the climate crisis at the start 


of the 20th century might have given 
humanity invaluable time to transition 
from fossil fuel-based energy systems to 
net-zero emission ones. This scientific 
collaboration might have reduced the 
likelihood of experiencing catastrophic 
consequences of climate change. 

Edgar Virgiiez 

Department of Global Ecology, Carnegie 


Institution for Science, Stanford, CA, USA. 
Email: evirguez@carnegiescience.edu 


David Harold Blackwell, US statistician 
and mathematician born in 1919, and 
George Washington Carver, US agricultural 
scientist born around 1864, could have 
collaborated to increase food security. 
Blackwell’s mathematical theory would 
help Carver study the genetic materials 
and characteristics of crops more accu- 
rately, so as to breed crops that meet 
people’s needs and improve farming meth- 
ods. Carver’s agricultural production data 
and experience could provide valuable 
practical materials for Blackwell’s research 
on statistics and decision-making. 

Yuan Zhi 


School of Economics, Guizhou University, Guiyang, 
Guizhou, China. Email: yzhi@gzu.edu.cn 


When 18th-century English immu- 
nologist Edward Jenner arrives—1600 
years before his time—at the Asclepion 
(healing temple) of the ancient Greek 
city of Pergamon, Galen takes no notice. 
Preoccupied with his theory of the four 
humors and dissections of Barbary 
macaques, Galen is uninterested in 
milkmaids’ lesions. But in 166 AD, Jenner 
uses cowpox to inoculate a young boy 

to the Antonine Plague (known today 

as smallpox). Galen, great student and 
synthesizer, engages Jenner, soon record- 
ing Jenner’s method in one of his medical 
treatises. Galen’s article is transcribed 
and traded, and smallpox is eradicated by 
300 AD (17 centuries before 1980). Other 
vaccinations follow. When Christopher 
Columbus arrives in the “New World,” 

his men and the Indigenous peoples 
exchange minor infections, but there is 
no epidemic. Over the next few centuries, 
as settlers push westward and wars break 
out, many Native American tribes, at full 
strength given the lack of new diseases, 
are able to repel the invaders. The United 
States, as it looks today, is never estab- 
lished. This journal is never founded, and 
this question is never asked. But hun- 
dreds of millions of lives are spared. 

John M. Dedyo 


Yale University, New Haven, CT, USA. 
Email: johnny.dedyo@yale.edu 


10.1126/science.adk8769 
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PHOTOSYNTHESIS 


Edited by Stella-Hurtley 


Algae’s chlorophyll c synthase 


hlorophylls, the key photosynthetic pigments in plants, algae, and cyanobacteria, come in a few different types depending on the 
identity, habitat, and needs of the organism. Most algae use chlorophyll c as an accessory pigment, but the biosynthetic origin and 
physiological importance of this pigment is not well understood. Jiang et a/. used recently developed genetic tools to identify a gene 
coding for a dioxygenase that is involved in producing chlorophyll c. They confirmed the activity of this enzyme with in vitro assays 
and found that algae with mutation in the gene had disrupted pigment levels and impaired photosynthetic activity. -MAF 


Science, adg7921, this issue p. 92 


GPCR SIGNALING 
How neutrophils decide 
on a destination 


Neutrophils are simultane- 
ously exposed to multiple 
chemoattractants that bind to 
Go, protein—activating receptors. 
Lundgren et al. investigated how 
neutrophils rank chemoattrac- 
tants that activate a common 
signaling pathway in human 
neutrophils and a neutrophil cell 
line. Signaling induced by che- 
moattractants typically released 
from sites of infection or damage 
lasted longer than that induced 
by lower-priority chemoattrac- 
tants. The rapid attenuation of 
signaling downstream of a low- 
priority chemoattractant receptor 
required the phosphorylation of 
the C-terminal tail of the receptor. 
—WW 
Sci. Signal. (2023) 
10.1126/scisignal.add1845 


BIOCHEMISTRY 
Radical at the heart 


To synthesize deoxyribonucleo- 
tides, organisms rely on a radical 
reaction catalyzed by a family 
of enzymes called ribonucleo- 
tide reductases. Generating the 
required radical and transfer- 
ring it to the correct residues 
within the enzyme active site is a 
major biochemical undertaking. 
Lebrette et al. have visualized 
the structure of one of the 
radical generation and storage 
subunits of this enzyme with 

an intact radical present in its 
core. The water molecules and 
hydrogen bonds around the 
dihydroxyphenylalanine radical 
are restructured relative to 
states where the radical is lost 
or transported out of the pro- 
tein. Observing this structure 

is only possible using serial 
femtosecond crystallography 
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methods that avoid x-ray dam- 
age entirely. —MAF 
Science, adh8160, this issue p. 109 


ANTHROPOLOGY 
Hidden societies 


Indigenous societies have 

lived in the Amazon for at least 
12,000 years. Finding evidence 
of these societies, however, has 
been greatly hampered by the 


Colored scanning electr 

micrograph (SEM) o 
photosynthetic Phaeodactylum 
tricornutum diatom 


density of the forest in Amazonia. 
Peripato et al. used LIDAR (light 
detection and ranging) surveys to 
identify more than 20 previously 
unidentified developments and 
then modeled the occurrence of 
others across the Amazon. The 
authors predict that between 
10,000 and 24,000 ancient 
earthworks are waiting to be 
discovered. Sampling of some of 
the LIDAR transects revealed a 


LIDAR data reveal earthworks (rings) beneath the forest canopy in Amazonia. 
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consistent set of domesticated 
tree species associated with the 
developments, suggesting active 
forestry practices among these 
societies. -SNV 

Science, ade2541, this issue p. 103 


QUANTUM INFORMATION 
Building a qubit system 
atom by atom 


Qubits implemented using 
solid-state spins are easy to 
manipulate, but such architec- 
tures remain tricky to scale up. 
Wang et al. used a scanning 
tunneling microscope (STM) 
to engineer an atomic-scale 
multi-qubit system consisting 
of asensor qubit and up to two 
remote qubits. The sensor qubit, 
a titanium atom on a bilayer mag- 
nesium oxide film, was directly 
controlled by the STM tip, whereas 
the remote qubits were each 
placed next to an iron atom and 
read out using the sensor qubit. 
The researchers demonstrated 
single- and multi-qubit gates. 
The size and performance of the 
setup may be further improved by 
optimizing the choice of materials 
and spin species. —JS 

Science, ade5050, this issue p. 87 


CORONAVIRUS 
Aseries of 
unfortunate events 


The severe acute respiratory 
syndrome coronavirus 2 (SARS- 
CoV-2) pandemic was marked 

by waves of new strains of virus 
differing in virulence and immune 
reactivity. The advent of each new 
variant of concern brought more 
human casualties and waves of 
onerous quarantine measures. 

To map the evolutionary trajec- 
tory of the variants, Wilks et al. 
obtained sera from people who 
had been vaccinated or infected 
with a range of variants of concern 
and applied antigenic cartography 
to visualize structural changes in 
the virus. The authors observed 
changes in immunodominance 
and immune escape depending 
on the variant that had infected 
the patient or after vaccination. 
Such analysis has implications 
for variant risk assessment and 


SCIENCE science.org 


for selecting the next candidate 
vaccine strains that will confer the 
highest protection. —CA 

Science, adjO070, this issue p. 68 


VACCINES 
An mRNA vaccine 


for mpox 
The recent mpox virus (MPXV) 
outbreak has highlighted the 
need for high-quality and safe 
Orthopoxvirus vaccines. The 
vaccine administered in response 
to the outbreak, a modified vac- 
cinia ankara (MVA), was effective 
at reducing mpox severity and 
transmission; however, a next- 
generation vaccine may have 
performed even better. Freyn et al. 
tested such a candidate vaccine, a 
messenger RNA (mRNA) vaccine 
encoding four highly conserved 
MPXV antigens. The mRNA 
vaccine performed as well as or 
better than an MVA compara- 
tor in terms of eliciting immune 
responses and protecting against 
lethal infection in mice. —CSM 
Sci. Transl. Med. (2023) 
10.1126/scitranslmed.adg3540 


IMMUNOLOGY 


Wash our hands of it? 


The hygiene hypothesis pro- 
poses that early-life exposure 
to microbes protects against 
the development of allergies. 
However, a causal link between 
early-life infections and allergies 
is lacking. To uncover the impact 
of lifelong microbial exposure 
on the development of allergic 
inflammation, Ma et al. compared 
experimental wildling mice, 
which have a representative 
breadth of naturally occur- 
ring pathogens, with specific 
pathogen-free mice. Surprisingly, 
when challenged with allergens, 
wildlings developed robust signs 
of pathological inflammation 
and allergic responses. Thus, 
increased microbial biodiversity 
in wildlings does not protect 
against allergic inflammation, 
in contrast to predictions of the 
hygiene hypothesis. —HMI 
Sci. lmmunol. (2023). 
10.1126/sciimmunol.adf7702 
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Immunofluorescence image 
of cardiac fibroblasts 


CARDIOLOGY 


Edited by Caroline Ash 
and Jesse Smith 


ACHIP off the old heart 


lonal hematopoiesis of indeterminate potential, or CHIP, is 

the buildup of potentially oncogenic mutations in hemato- 

poietic stem cells, and its risk greatly increases with age. It 

is not a malignant condition itself, but it does predispose 

patients to hematological cancers and to multiple types of 
cardiovascular disease, increasing the risk of death. Abplanalp 
et al. and Rauch et al. examined the effects of common CHIP 
mutations on monocytes and macrophages, blood cells that 
play key roles in the process of inflammation. The researchers 
linked the presence of CHIP mutations in these cells to pro- 
inflammatory activity, helping to explain how CHIP contributes 
to both heart failure and atherosclerosis. —YN 


Nat. Cardiovasc. Res. (2023) 


10.1038/s44161-023-00322-x, 10.1038/s44161-023-00326-7 


IMMUNOLOGY 
New roles for an 
immune stalwart 


Group 2 innate lymphoid cells 
(ILC2s) tend to reside in tissues 


where pathogens enter the body, 


and they contribute to immunity 
against parasite infection. Like 
other types of immune cells, 
excessive proinflammatory 
activity of ILC2s can contrib- 
ute to disease. Cui et al. found 
that CD45, a transmembrane 


tyrosine phosphatase that is 
important for the development 
and activation of T cells, plays a 
role in restraining ILC2s. Deletion 
of CD45 in ILC2s increased cell 
numbers and maturation under 
steady-state conditions and 
triggered their hyperactivation 
during immune responses. This 
was associated with exacerbated 
lung inflammation and pathology 
in mouse models of asthma and 
pulmonary fibrosis. The results 
not only show the regulatory 
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MICROBIOLOGY 
Protection from 
friendly fire 


Mammalian cells use guard 
mechanisms to monitor their 
functional pathways for interfer- 
ence by pathogens. Infection 
causes the production of the 
inflammatory cytokine inter- 
feron-y (IFN-y), which triggers 
the expression of hundreds of 
IFN-stimulated-genes, includ- 
ing the kinase PIM1 and GBPI1, a 
membrane-perturbing GTPase. 
Fisch et al. identified a guard 
mechanism whereby PIM1 phos- 
phorylates GBP1 and subjects 
it to sequestration by a 14-3-3 
protein. In human macrophages, 
this mechanism was found to 
prevent GBP1 activity from caus- 
ing Golgi fragmentation and cell 
death. Pathogens can interfere 
with IFN-y signaling and thereby 
potentially escape immune 
detection. However, when this 
signaling is inhibited, short-lived 
PIM1 is degraded, which allows 
GBP1 to control pathogen growth. 
These findings suggest a model 
of IFN-y—dependent protection of 
uninfected bystander cells against 
self-inflicted innate immune dam- 
age. —SMH 

Science, adg2253, this issue p. 67 


MATERIALS SCIENCE 
Moving dislocations 
in diamond 


The speed of sound is often a limit 
of asort on how fast something 
can move through a system. For 
dislocation motions that play 

a large part in plastic deforma- 
tion, the limits are very poorly 
constrained from experiments. 
Katagiri et al. used x-ray radiog- 
raphy of shocked single-crystal 
diamond to track dislocation 
motion during plastic deforma- 
tion (see the Perspective by 
Knudson and Seagle). They found 
that the dislocation speed can be 
higher than the bulk sound speed 
of the material. This observation 
is important for refining deforma- 
tion models at extreme conditions 


SCIENCE science.org 


because these ultrafast disloca- 
tion motions have previously only 
been predicted with theory. —BG 
Science, adh5563, this issue p. 69; 
see also adk4420, p. 37 


HUMAN EVOLUTION 
Expanding the evidence 


Traditionally, researchers 
believed that humans arrived in 
North America around 16,000 
to 13,000 years ago. Recently, 
however, evidence has accumu- 
lated supporting a much earlier 
date. In 2021, fossilized footprints 
from White Sands National Park 
in New Mexico were dated to 
between 20,000 and 23,000 
years ago, providing key evidence 
for earlier occupation, although 
this finding was controversial. 
Pigati et al. returned to the White 
Sands footprints and obtained 
new dates from multiple, 
highly reliable sources (see the 
Perspective by Philippsen). They, 
too, resolved dates of 20,000 to 
23,000 years ago, reconfirming 
that humans were present far 
south of the ice sheets during the 
Last Glacial Maximum. —SNV 
Science, adh5007, this issue p. 73; 
see also adk3075, p. 36 


NEUROSCIENCE 
Preparing for motherhood 


Motherhood leads to pronounced 
behavioral changes in many 
species, such as altered feeding 
routines and increased aggres- 
sivity. How does pregnancy 
prepare females for such future 
behavioral needs? Ammari et 
al. discovered that the hor- 
monal milieu of pregnancy 
remodels a distinct popula- 

tion of hypothalamic neurons 

in mice (see the Perspective 

by McCarthy) that mediates 
the onset of parental behavior 
before giving birth. Sensing of 
estradiol and progesterone by 
galanin-expressing neurons in the 
medial preoptic area is neces- 
sary for this behavioral change. 
Therefore, hormone-mediated 
neuronal modifications lead to 
an increased selectivity for pup 


stimuli, thus anticipating future 
parenting behaviors. —PRS 
Science, adi0576, this issue p. 76; 
see also adk2495, p. 33 


2D MATERIALS 
Landau levels 
in a twisted system 


In a perpendicular magnetic field, 
electrons in two-dimensional 
(2D) materials occupy discrete 
energy levels known as Landau 
levels. These levels are described 
by the so-called Onsager relation. 
Slot et al. used scanning tunnel- 
ing spectroscopy to measure 
the Landau levels in twisted 
double-bilayer graphene with an 
intermediate twist angle of 1.74° 
between the two bilayers. By 
comparing their results against 
the theoretical expectations, the 
researchers were able to measure 
deviations from the standard 
form of the Onsager relation 
stemming from the orbital mag- 
netism in the system. —JS 
Science, adf2040, this issue p. 81 


CATALYSIS 
Oscillations in 
hydrocarbon synthesis 


Reaction temperature and 
product formation oscillate dur- 
ing the hydrogenation of carbon 
monoxide to form hydrocarbons 
(Fischer-Tropsch synthesis) over 
cobalt—ceria catalysts. Zhang et 
al. observed long-duration oscil- 
lations of several degrees Celsius 
with a period of several minutes 
for a1:1 reactant feed over 
cobalt—cerium oxide powder cat- 
alyst operating at 220°C, as well 
as changes in product forma- 
tion and reaction rates (see the 
Perspective by Niemantsverdriet 
and Westrate). A kinetic model 
including a carbon monoxide 
insertion step, thermal activation 
of carbon—oxygen bonds, and 
periodic temperature forcing 
could account for the observa- 
tions. —PDS 

Science, adh8463, this issue p. 99; 

see also adk5831, p.35 
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CELL BIOLOGY 


Exocytosis by membrane crumpling 


xocrine secretion is less well understood than neurotransmitter release. Exocrine secretory 

granules are large organelles, and not only are the mechanisms by which they fuse with the 

plasma membrane poorly known but also how cells maintain plasma membrane homeostasis 

after granule release. Biton et al. used advanced imaging techniques in Drosophila larval sali- 

vary glands to assess the machinery that governs the fusion pore dynamics in large exocrine 
secretory vesicles. Branched actin, myosin ||, and BAR-domain proteins collaborate to control the 
fusion pore, facilitating a distinctive type of exocytosis called membrane crumpling that preserves 
apical membrane homeostasis during secretion. This mechanism allows large volumes of material 
to be secreted without disturbing the delicate balance of the cell’s plasma membrane. —SMH 


J. Cell Biol. (2023) 10.1083/jcb.202302112 


Drosophila salivary gland secretory granules fuse with the cell surface membrane and recruit actomyosin (magenta) 
to extrude the contents (green) by membrane crumpling. 


mechanisms underpinning 
ILC2s, but also reveal a previ- 
ously unknown role for CD45 in 
the immune system. —SHR 
Proc. Natl. Acad. Sci. U.S.A. (2023) 
10.1073/pnas.2215941120 


CATALYSIS 
Cobalt’s situation in 
hydroformylation 


Cobalt carbonyl complexes 

are productive catalysts for 

the preparation of aldehydes 

by hydroformylation. However, 
they have been displaced by 
much more expensive rhodium 
in many applications because of 
their instability. A recent study 


reported that chelating bispho- 
sphine ligands could greatly 
stabilize the cobalt catalysts, 
although a second study failed 
to reproduce those results 
and suggested instead that 
the phosphine-free carbonyl 
complex was more stable than 
previously thought. Hood et al. 
now revisit their original work 
with the chelating bisphosphine 
igands and present multifaceted 
spectroscopic evidence that 
these are indeed active cata- 
ysts. They further propose that 
impure precursors may have 
hindered the prior reproduction 
effort. —JSY 

J.Am. Chem. Soc. (2023) 

10.1021/jacs.3c04866 
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PROTEIN SEQUENCING 
Threading proteins 
through a nanopore 


Nanopores are single-molecule 
biosensors and are widely used 
for ultrasensitive DNA sequenc- 
ing. Uniform negative charges 

of single-stranded DNA allow an 
electric field to be used to move 
the DNA through the pore unidi- 
rectionally; however, proteins are 
chains of amino acids with het- 
erogeneous charge distribution. 
Alternatively, osmotic pressure 
created by a net flux of water 
generated by anet ion charge on 
one side of the pore can be used 


to drive translocation of proteins. 


Sauciuc et al. show that such an 


electroosmotic flow approach 
can drive the transport of both 
unstructured model polypeptides 
and unfolded native proteins. 
—DJ 
Nat. Biotechnol. (2023) 
10.1038/s41587-023-01954-x 


TIME-DOMAIN ASTRONOMY 
Arapid luminous optical 
transient 


The cadences of astronomical 
transient surveys enable the 
identification of unusual sources 
that evolve on time scales of a 
few days. Nicholl et al. report the 
detection of an optical transient 
that is as luminous as a super- 
luminous supernova but evolves 
much more rapidly and has a 
different type of host galaxy. 
The source took 12 days to 
reach peak brightness and fade 
again afterward, which typically 
takes months for supernovae. 
The authors maintain that there 
are only two previously known 
transients with similar properties 
and that these three examples 
constitute a new class of sources. 
No theoretical models match all 
of their properties. —KTS 
Astrophys. J. Lett. (2023) 
10.3847/2041-8213/acfOba 


BIOSYNTHESIS 
B. bonanza 


Vitamin B,, is an extremely 
complex molecule and performs 
some of the most fascinating 
chemistry in all of biology. It is 
an essential part of the animal 
food chain and is produced 
exclusively by microorganisms. 
Kang et al. developed a cell-free 
biosynthetic system to pro- 
duce vitamin B,, consisting of 
four modules and a total of 26 
individual enzymes. In addition 
to the biosynthetic enzymes, a 
complex regeneration scheme 
was used to replenish cofac- 
tors and cosubstrates. The 
amount of B,, produced in this 
way approaches what can be 
achieved with microbial fermen- 
tation, and this system should be 
amenable to further optimiza- 
tion and customization. —MAF 
Nat. Commun. (2023) 
10.1038/s41467-023-40932-4 


science.org SCIENCE 


RESEARCH 


RESEARCH ARTICLE SUMMARY 


MICROBIOLOGY 


PIM1 controls GBP1 activity to limit self-damage 
and to guard against pathogen infection 


Daniel Fisch, Moritz M. Pfleiderer, Eleni Anastasakou, Gillian M. Mackie, Fabian Wendt, Xiangyang Liu, 
Barbara Clough, Samuel Lara-Reyna, Vesela Encheva, Ambrosius P. Snijders, Hironori Bando, 
Masahiro Yamamoto, Andrew D. Beggs, Jason Mercer, Avinash R. Shenoy, Bernd Wollscheid, 
Kendle M. Maslowski, Wojtek P. Galej, Eva-Maria Frickel* 


INTRODUCTION: Cells in infected tissues are ex- 
posed to inflammatory stimuli, including the 
innate and adaptive immunity-stimulating 
cytokine interferon-y (IFN-y). Although most 
tissue-resident and infiltrating cells are not 
infected, when exposed to IFN-y, these bystander 
cells preemptively express a repertoire of 
interferon-stimulated genes (ISGs) with robust 
antimicrobial activities and the potential for 
self-harm. ISGs of the guanylate-binding pro- 
tein (GBP) family are large, membrane-active 
guanosine triphosphatases (GTPases). GBPs 
can control intracellular microbes in various 
ways, most importantly by promoting mem- 
brane rupture and the release of microbial 
ligands and by the induction of programmed 
cell death, including pyroptosis and apop- 
tosis. How uninfected cells protect them- 
selves from the potentially self-destructive 
actions of GBPs while keeping these proteins 
readily available to combat infection is 
unknown. 


Uninfected bystander cells 
IFN-y 


) &.g., NSC756093, 


RATIONALE: Cells need to tightly control the ac- 
tivity of antimicrobial proteins but rapidly de- 
ploy them upon infection. How is this achieved 
in human cells? Posttranslational modifications, 
such as phosphorylation by protein kinases, en- 
able rapid and precise control of protein ac- 
tivities. We studied the phosphorylation of 
GBPI, a typical ISG, and how this modification 
affects its function, activity, and localization in 
human macrophages. 


RESULTS: Ectopic expression of GBP1 in human 
macrophages led to changes in cell morphology, 
GBP1 accumulation at the Golgi apparatus, Golgi 
fragmentation, and uncontrolled cellular ne- 
crosis. These findings illustrate GBP1’s poten- 
tial to inflict self-damage. This phenotype was 
mitigated by IFN-y treatment, suggesting that 
another IFN-y-inducible factor limited GBP1 
activity. We identified the kinase PIM1 as being 
this factor. We generated a phosphorylation- 
specific antibody and used high-resolution 
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The protein kinase PIM1 controls GBP1 activity. In uninfected bystander cells, IFN-y signaling induces 
expression of PIM1, which phosphorylates GBP1, subjecting it to sequestration by 14-3-3o. This shunts 
aberrant GBP1 activity and ensures cell survival (left). Perturbation of this homeostasis leads to GBP1 
dephosphorylation and Golgi targeting and disruption, culminating in necrosis (middle). Intracellular 
microbes that disrupt IFN-y signaling, e.g., T. gondii, deplete PIM1 and liberate GBP1, which then attacks 


and controls the infection (right). 
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nm 
mass spectrometry to demonstrate GBP1 p Che: 
phorylation at serine-156 (Ser), which |." 
guided by a basophilic PIM1 recognition motif. 
Ser’®® is the central residue of a 14-3-3 protein 
binding motif, which suggests a switch-like 
function for its phosphorylation. Indeed, 14-3-3 
proteins, especially 14-3-30, interacted with 
phosphorylated GBPI1. In vitro reconstitution 
of this complex followed by single-particle 
cryo-electron microscopy confirmed a 14-3-30 
dimer grabbing onto the GBP1 GTPase do- 
main. This binding locked GBP1 in a GTPase- 
inactive, monomeric state and restrained its 
activity in the macrophage cytosol. Expressing 
phosphorylation-deficient GBP1 mutants or 
mutants that could not be recognized by the 
kinase PIM1 or bound by 14-3-3o led to un- 
controlled GBP1 activation and subsequent 
cell death. Genetic depletion of either PIM1 or 
14-3-30 had similar outcomes, as did treat- 
ment with the GBP1:PIM1 interaction inhibi- 
tor NSC756093. Using the inhibitor in IFN-y- 
activated patient-derived tumor organoids 
increased organoid death and prevented or- 
ganoid reformation. Thus, we found that PIM1 
and 14-3-30 together controlled the activity 
of GBP1 in human cells. Disrupting PIM1- 
driven control of GBP1 has potential thera- 
peutic implications for cancer therapy and 
innate immunity. 

We observed that PIM1 mRNA and protein 
were extremely short-lived. Infection with the 
apicomplexan parasite Toxoplasma gondii, a 
pathogen that resides within intracellular vacu- 
oles and blocks IFN-y signaling by means of the 
effector protein TgIST, led to fast depletion of 
PIMI. This in turn reduced GBP1 Ser’”® phos- 
phorylation levels and liberated GBP1 from 
14-3-30 sequestration. High-throughput im- 
aging revealed that GBP1 then rapidly targeted 
Toxoplasma-containing vacuoles to improve 
control of the infection. 


CONCLUSION: The IFN-y-induced, short-lived 
kinase PIM1 guards the integrity of IFN-y sig- 
naling and protects self-membranes by regu- 
lating the activity of the potent antimicrobial 
effector GBP1. Pathogens that block IFN-y sig- 
naling, thereby reducing the levels of PIM1, 
are then exposed to GBP1-driven innate im- 
mune control. The phosphoregulation of 
GBP! by PIMI reveals an IFN-y-dependent 
control mechanism that protects uninfected 
bystander cells from self-inflicted innate im- 
mune damage during the process of pathogen 
elimination. 
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PIM1 controls GBP1 activity to limit self-damage 
and to guard against pathogen infection 


Daniel Fisch'”, Moritz M. Pfleiderer®, Eleni Anastasakou®, Gillian M. Mackie*, Fabian Wendt®°, 
Xiangyang Liu’, Barbara Clough”, Samuel Lara-Reyna®, Vesela Encheva’, Ambrosius P. Snijders”®, 
Hironori Bando?”°, Masahiro Yamamoto®”°, Andrew D. Beggs”, Jason Mercer’, 

Avinash R. Shenoy’, Bernd Wollscheid>*, Kendle M. Maslowski*?47>"6, 


Wojtek P. Galej°, Eva-Maria Frickel** 


Disruption of cellular activities by pathogen virulence factors can trigger innate immune responses. 
Interferon-y (IFN-y)-inducible antimicrobial factors, such as the guanylate binding proteins (GBPs), 
promote cell-intrinsic defense by attacking intracellular pathogens and by inducing programmed 

cell death. Working in human macrophages, we discovered that GBP1 expression in the absence of 
IFN-y killed the cells and induced Golgi fragmentation. IFN-y exposure improved macrophage survival 
through the activity of the kinase PIM1. PIM1 phosphorylated GBP1, leading to its sequestration by 
14-3-36, which thereby prevented GBP1 membrane association. During Toxoplasma gondii infection, the 
virulence protein TgIST interfered with IFN-y signaling and depleted PIM1, thereby increasing GBP1 
activity. Although infected cells can restrain pathogens in a GBP1-dependent manner, this mechanism 
can protect uninfected bystander cells. Thus, PIM1 can provide a bait for pathogen virulence factors, 


guarding the integrity of IFN-y signaling. 


nterferon-y (IFN-y) is a pivotal cytokine 
that promotes innate and adaptive immu- 
nity during infection. IFN-y acts on all nu- 
cleated cells to induce expression of a large 
repertoire of interferon-stimulated genes 
(ISGs) with potent antimicrobial activities (1). 
Pathogens are targeted by antimicrobial path- 
ways after their detection by pattern recognition 
receptors (PRRs) (pattern-triggered immunity) 
or when they disrupt a regulator of an immune 
response (effector-triggered immunity). The 
latter mechanism, also referred to as guard 
immunity, was originally described in plants 
and suggests that innate immunity can indi- 
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rectly detect virulence factors by their activ- 
ities (2). The complexity of guard immunity in 
mammalian systems remains largely unex- 
plored [reviewed in (3)]. Whether guard im- 
munity exists in the context of inflammatory 
type II interferon production and signaling is 
unknown. 

Among the ISGs are large guanosine triphos- 
phatases (GTPases) of the GBP family (4). GBPs 
restrict infection by targeting intracellular mi- 
crobes, thereby promoting their rupture and 
the release of microbial ligands; by redirecting 
autophagy and oxidative machinery to pathogen- 
containing compartments; and by removing rep- 
licative niches through induction of pyroptosis 
or apoptosis (5-15). IFN-y up-regulates antimi- 
crobial genes, including GBPs, in the primary 
infected cells and in neighboring, uninfected 
bystander cells. IFN-y thus induces ubiquitous 
expression of GBPs in most cell types. Among 
GBPs, GBP1 is the most abundant in IFN-y- 
primed cells. GBP1 has a C-terminal prenylation 
site that enables its membrane association 
(6) and forms homo- and heterodimers and 
membrane-bound coatomers with itself or with 
other GBPs (17, 18). The GBP1 GTPase converts 
guanosine triphosphate (GTP) to guanosine 
monophosphate (GMP) in two steps (19). GBP1 
can target membranes of the host Golgi appa- 
ratus (16, 20) as well as plasma membrane- 
derived vacuoles containing Toxoplasma gondii 
(Tg) (10, 11) or lipopolysaccharide (LPS) on cy- 
tosolic bacteria (7, 11, 13-15). However, how 
uninfected cells protect themselves from the 
potentially destructive actions of GBP1 remains 
unclear. 
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IFN-y and phosphorylation control GBP1 
activity and prevent cytotoxicity 

To investigate how the activities of GBP! are reg- 
ulated in uninfected cells, we used a doxycycline 
(Dox)-inducible expression system in phorbol 
myristate acetate (PMA)-differentiated THP-1 
macrophages (THP-IAGBPI+GBPI) (11). We 
first measured the survival of cells that ectopi- 
cally expressed GBP1 in the absence of expo- 
sure to IFN-y. Dox-induced ectopic expression 
of GBP1 was cytotoxic, progressively leading to 
~75% cell loss over 7 days (Fig. 1A). Exposure of 
GBP1-expressing cells to IFN-y reduced cell 
loss by ~55% (Fig. 1A). Treatment with IFN-y 
in the absence of ectopic GBP1 expression re- 
sulted in minimal cell loss (Fig. 1A). Thus, we 
surmised that an unknown IFN-y-dependent 
process and/or protein(s) must block GBP1- 
driven toxicity. 

GBP1 contains three putative, surface-exposed 
phosphorylation sites: Ser’®, Ser’®’, and Tyr*?” 
(Fig. 1B). Immunoprecipitated Flag-GBP1 from 
macrophages probed for phosphorylated ser- 
ine residues confirmed the presence of a phos- 
phorylated GBP1 species, the amount of which 
increased upon exposure to IFN-y (Fig. 1C). We 
hypothesized that IFN-y-mediated phosphoryl- 
ation of GBP1 could be a regulatory switch that 
suppresses the toxicity of GBP1. To test this 
possibility, we studied phosphorylation-deficient 
mutants of GBP1 [S156A (Ser’’°—Ala), SI57A 
(Ser®””— Ala), Y427F (Tyr*?’”—Phe)] expressed in 
AGBPI cells (fig. S1). Whereas S157A and Y427F 
had no effect on the dispersed, granular appear- 
ance of GBP1 within the cytosol, GBP15™ ac- 
cumulated in perinuclear areas (fig. S2A), which 
were revealed to be the Golgi apparatus by im- 
munofluorescence assays (fig. S2B). Further- 
more, GBP1*'°*-expressing cells were more 
elongated than cells expressing wild-type (WT) 
GBPI (fig. S2C) and displayed changes in their 
cortical actin cytoskeleton (fig. S2D). Phospho- 
rylation of GBP1 Ser’®* might thus control lo- 
calization and activities of GBP1. 

To test whether GBP1 was phosphorylated at 
Ser””® specifically, we raised a phosphospecific 
antibody and validated its specificity with the 
following results: First, the antibody specifically 
interacted with phosphorylated GBP1 peptide 
(epitope: amino acids Ie’ to Cys’); second, it 
bound to immunoprecipitated Flag-GBP1 WT 
but not Flag-GBP1S"™; third, with the exception 
of minor reactivity toward GBP3, the antibody 
showed no cross-reactivity with other GBPs; 
and fourth, it specifically stained mCherry-tagged 
GBP1 (mCH-GBP1) but not mCH-GBP1S™ in 
immunofluorescence assays (fig. S3, A to D). 
Using this antibody, we confirmed that GBP1 
was phosphorylated at Ser’®* upon exposure of 
cells to IFN-y by immunoprecipitation and 
immunoblot assays (Fig. 1D). Expression of 
GBP15°™ was more toxic than that of WT GBP1, 
and IFN-y treatment no longer suppressed the 
toxicity of GBP1S*™ (Fig. 1E). We thus checked 
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macrophages that expressed WT GBP1 or 
GBP15° for markers of regulated cell death 
(fig. S4A) but did not observe any hallmarks 
of apoptosis, necroptosis, or pyroptosis after 
2,4, or 7 days (fig. S4B). Chemical inhibition 
of regulated cell-death pathways failed to 
prevent cells from dying over 4 days of con- 
stitutive ectopic expression of GBP! (fig. S4C). 
Again, addition of IFN-y rescued cells that 
expressed WT GBP1 but not GBP1S”™* and 
minimally affected cell viability by itself (fig. 
$4C). Thus, GBP1 phosphorylation on Ser’”® 
was required to prevent unwanted toxicity 
and unregulated, necrotic death of human 
macrophages. 

We next asked whether phosphorylation of 
GBP1 on Ser’ affects its known activities, such 
as oligomerization (14, 17, 18) and membrane 
association (16). Immunoblots showed that 
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cytosolic WT GBP1 was monomeric, whereas 
GBP15* dimers were readily detected, which 
is suggestive of increased GBP1 activity (Fig. 
1F). Subcellular fractionation showed increased 
membrane association of GBP15*“ (Fig. 1G). 
Because GBP15 also exhibited increased 
Golgi targeting (fig. S2B), we next assessed 
Golgi morphology after 2 days of continuous 
GBPI1 expression. We observed increased frag- 
mentation of the Golgi in cells that expressed 
GBP15%* as compared with cells that expressed 
WT GBPI; the cells that expressed GBPIS*® 
showed ~2.5 times more fragments per cell 
with a decreased average fragment size (Fig. 
1H). Taken together, the phosphodeficient 
GBP1°° mutant showed increased associa- 
tion with and disruption of the Golgi, which 
was accompanied by unregulated cellular ne- 
crosis. Thus, Ser'® phosphorylation suppresses 
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smaller images on the right are zoomed-in views of the regions outlined by 

the white rectangles. Scale bars are 20 um. Green, GM-130 (Golgi); magenta, 
mCH-GBPI,; blue, nuclei. (I) Graphic summarizing findings on GBP1 activity 
control. Data information: Images in (C) and (D) and (F) to (H) are representative 
of n = 3 experiments. Graphs in (A), (E), (G), and (H) show mean + SEM from 

n = 3 experiments. ***P < 0.001 and ****P < 0.0001 for the indicated comparisons 
in (A) and (E) from one-way analysis of variance (ANOVA) and in (H) from unpaired 
Student's t tests; ns is not significant. For gel source data, see data SO. 


GBP! activities and protects cells from self- 
inflicted damage (Fig. 11). 


The kinase PIM1 phosphorylates GBP1 at Ser’®* 


To narrow down the number of kinases that 
are capable of phosphorylating GBP1 on Ser’”®, 
we first used in silico analysis, which identi- 
fied nine candidate protein kinases (fig. S5A). 
For each of these kinases, we confirmed expres- 
sion in macrophages by reverse transcription 
quantitative polymerase chain reaction (RT- 
qPCR) and reduced their expression with small 
interfering RNA (siRNA) silencing (figs. S5B 
and S6). Next, we immunoprecipitated Flag- 
GBP1 from THP-1 cells transfected with siRNAs 
directed against the kinases or against pro- 
tein phosphatase 1/2 catalytic subunit alpha 
(PPPICA, PPP2CA; Fig. 2A). Quantification 
of phosphoproteins suggested a reduction in 
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total phosphorylation of GBP1 when PIM-family 
kinases were silenced, whereas silencing of 
PPPICA or PPP2CA increased its total phos- 
phorylation levels (Fig. 2A). Immunoblots using 
the Ser’ phosphospecific antibody showed that 
loss of PIM1 most strongly reduced phosphoryl- 
ation of GBP1 at this residue (Fig. 2A). We next 
tested whether PIM family kinases could di- 
rectly phosphorylate GBP1. Indeed, recombi- 
nant PIM1, 2, or 3 all phosphorylated GBP1 to 
varying degrees in vitro (Fig. 2B). Furthermore, 
we confirmed a previously observed direct in- 
teraction between PIM1 and GBP! (27) in IFN-y- 
exposed THP-1 cells after cross-linking and 
immunoprecipitation. The PIM1:GBP1 inter- 
action inhibitor NSC756093 abolished this 
interaction (Fig. 2C). Next, we generated THP- 
1APIMI1 CRISPR-Cas9 knockout cells and con- 
firmed the knockout of PIM1 by immunoblotting 
and RT-qPCR (fig. S7A). Endogenous GBP1 was 
not phosphorylated at Ser’? in IFN-y-exposed 
APIM1 cells, and phosphorylation was restored 
upon reconstitution with PIM1 WT but not 
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with a PIM1 kinase-dead mutant [PIM1°®5 (22); 
Fig. 2D and fig. S7B]. Immunoprecipitation of 
endogenous GBP1 from IFN-y-exposed THP-1 
WT cells with the phosphospecific antibody 
showed that ~40% of the cellular GBP1 pool 
was phosphorylated (Fig. 2E). 

PIM1, 2, and 3 are >65% similar to each other 
but vary in their efficiency of GBP1 phosphoryl- 
ation (Fig. 2, A and B). To assess whether PIM2 
and 3 also play a role in cellulo, we compared 
their expression levels in naive and IFN-y-treated 
macrophages by RT-qPCR and immunoblot- 
ting. Only PIM1 showed IFN-y inducibility 
(fig. S8A), confirming previous findings (23). 
Knockout of PIM1 did not influence the ki- 
netics or magnitude of GBP1 expression (fig. 
S8B), but endogenous GBP1 was no longer 
phosphorylated at Ser’®° in APIM1 cells (Fig. 
2D and fig. S8C). Additional depletion of PIM2 
or PIM3 in APIM] cells had no influence on 
phosphorylation of GBP1 as determined by 
immunoblotting (fig. S8C). However, immu- 
noprecipitation and ProQ stain of endoge- 
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liquid chromatography-MS/MS; n.d., not detected. Data information: Images in (A) 
to (E) are representative of n = 3 experiments. Graphs in (A) show mean + SD from 
n = 3 experiments, and graphs in (F) show mean + SD from n = 3 replicates. 

*P < 0.05, **P < 0.01, ***P < 0.001, and ****P < 0.0001 in (A) from one-way 
ANOVA comparing all conditions with siRNA CTRL-transfected cells and in (F) from 
unpaired two sample t test after adjustment for multiple comparisons. For gel 
source data, see data S5. 


nous GBP1 from APIM] cells showed that the 
protein remained phosphorylated at residues 
other than Ser’”® (fig. S8D). 

Because PIM1 showed a prominent effect on 
the phosphorylation of GBP1 and is a known 
GBP1 interactor in cancer cells, we performed 
additional experiments with recombinant PIM1 
and GBP1 to explore their interaction. These ex- 
periments showed a concentration-dependent 
increase in the phosphorylation of GBP1 by 
PIM1I, which was abolished by inclusion of 
NSC756093, corroborating a requirement for 
direct interaction between the two proteins 
(fig. SSE). High-resolution mass spectrometry 
(MS) enabled dissection of the PIM1 kinase- 
GBP1 substrate relationship and identified 
three phosphorylation sites (Ser’”®, Ser®®, and 
Thr”; Fig. 2F). Fragment peak annotation 
of the electron-transfer dissociation (ETD)- 
activated tandem MS (MS/MS) spectra for 
the corresponding monophosphorylated pep- 
tides unequivocally confirmed phosphoryl- 
ation at Ser’ and Thr°”° (MaxQuant PTM site 
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localization probability >0.9999; fig. S8F and 
data S1). Ser’? phosphorylation increased by 14- 
fold upon the addition of adenosine triphos- 
phate (ATP), firmly establishing PIM1 as a 
bona fide GBP1 kinase. 

We next sought to identify the PIM1 recog- 
nition sequence within GBP1. Evolutionary 
analysis of >3300 eukaryotic GBP genes iden- 
tified the closest GBP1 homologs in 484 species 
(fig. S9A and data S2). Most of the analyzed 
organisms also had a well-conserved PIM1 ho- 
molog (data S2). The GBP1 homologs were 
analyzed for the presence of the corresponding 
phosphoserine, and the amino acid sequence 
upstream of the putative phosphoserine residue 
was compared with previously reported PIM1 
recognition motifs (24-27). From this, it ap- 
peared that a basophilic kinase substrate motif 
(R’"XRXKS”"* where R is Arg, X is any residue, 
K is Lys, and S is Ser) was required for recog- 
nition of GBP1 by PIM1 (fig. S9A). The PIM1 motif 
and the putative phosphorylation site (residue 
equivalent to Ser’”®) were well conserved across 
evolution, and their appearance in GBP se- 
quences correlated with each other (fig. S9A). 
We used a series of GBP1 mutants (fig. S1) to 
assess the functional role of this motif in guid- 
ing Ser’° phosphorylation. We observed a com- 
plete loss of Ser’*® phosphorylation in K155A 
or R153A mutants and reduced phosphorylation 
for RI51A (fig. SOB). Because Lys” and Arg” lie 
within the phosphospecific antibody epitope, 
their mutation could impair antibody binding. 
We thus used phosphoprotein-specific staining to 
assess phosphorylation independently. GBPS“ 
had residual phosphoprotein-specific staining 
levels similar to that of GBP1S'°°, whereas 
GBP1**4 or GBP1*“ showed only a ~40% re- 
duction (fig. S9B). Similarly, GBPIRe“Y/®5*4 and 
all mutants including K155A displayed reduced 
levels of phosphorylation, which were similar 
to the residual level of S156A (fig. SOB). Thus, 
the PIM1 phosphorylation motif in GBP1 is a 
basophilic kinase substrate motif. 

Having established the GBP1 recognition 
motif for PIM1, we next assessed the other hu- 
man GBP family members (fig. S9C). From 
this, it appeared that only GBP3, which is ex- 
pressed at low levels in human macrophages 
(11), possesses a functional motif. All other 
GBPs have disruptive amino acid replacements 
in the recognition motif (GBP2, 4, 6, and 7) 
or a mutated phosphorylation acceptor site 
(GBP5; fig. S9C). We next tested the impor- 
tance of PIM1 on cellular survival. In APIM7 
cells, IFN-y stimulation alone was sufficient to 
induce gradual cell loss over 7 days (fig. S9D), 
which resembled ectopic expression of WT 
GBP1 in naive cells (Fig. 1A). Notably, the si- 
lencing of GBP1, but no other GBP, prevented 
viability loss, indicating that in the absence 
of PIM1, GBP1 was the only GBP activated in 
an uncontrolled manner (fig. S9D). IFN-y- 
inducible PIM1 is thus directly responsible 
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for phosphorylation of GBP1 on Ser!°, which 
prevents damage to cells. 


14-3-3o-bound phospho-GBP1 is inactive 


Ser’”® of GBP1 is part of a flexible, surface-exposed 
loop and is the central serine of a putative, evo- 
lutionarily conserved mode I 14-3-3 binding 
motif (figs. S9A and S10A). MS analysis of 
GBP1-interacting proteins in IFN-y-treated 
human macrophages showed the presence of 
all 14-3-3 proteins, including significant inter- 
actions with 14-3-38, C, and o (Fig. 3A and data 
$3), as expected (28). Immunoprecipitation of 
endogenous GBP1 confirmed its interaction 
with 14-3-3 proteins (pan-14-3-3 antibody) and 
with 14-3-30 specifically (Fig. 3B). Immunopre- 
cipitation of co-transfected Flag-GBP1 and 
hemagglutinin (HA)-tagged 14-3-3 proteins 
(B, y, €, ¢, n, 8, and o) showed promiscuous 
interaction of all 14-3-3 proteins with GBP1 
WT but not with GBPS!“ or GBPIRE34/ P84 
carrying a mutated 14-3-3 binding motif (fig. 
S10B). The GBP1"°*™ mutation did not affect 
its binding to 14-3-3 proteins (fig. SIOB). We 
thus further examined the potential roles of 
Ser’”° phosphorylation. The mRNA levels for 
the seven 14-3-3 proteins varied in human mac- 
rophages, with the abundance of 14-3-30 being 
the lowest in naive cells and showing an in- 
crease upon IFN-y treatment (figs. S6 and S10C). 
Expression of the other 14-3-3 family members 
remained unchanged (fig. S10C). Semiquanti- 
tative coimmunoprecipitations of phosphoryl- 
ated endogenous GBP1 and purified HA-tagged 
14-3-3 proteins corroborated our findings and 
showed the highest binding affinity between 
GBP1 and 14-3-3o [dissociation constant (Kg) = 
0.37 + 0.10 uM (SD); fig. S11]. This was inde- 
pendently confirmed by isothermal titration 
calorimetry (ITC) with recombinant 14-3-36 
and in vitro phosphorylated recombinant GBP1 
(Fig. 3C and fig. S12). Unphosphorylated GBP1 
did not bind 14-3-30 (Fig. 3C), confirming its 
specific interaction with the phosphorylated 
protein. 

We next reconstituted the GBP1-PIM1-14- 
3-30 reaction in vitro and found that these 
proteins formed a ~125-kDa complex in the 
presence of ATP, most likely composed of one 
monomer of phosphorylated GBP1 (67 kDa) 
and a 14-3-30 dimer (dimer: 56 kDa) (fig. S13A). 
The stable phospho-GBP1:14-3-30 complex could 
be purified and analyzed by size exclusion 
chromatography (fig. S13B). Dimerization of 
GBP1 occurred in absence of GTP but increased 
markedly in the presence of GIP or GDP plus 
AIF;. The latter locks GBP1 in a GDP-bound 
state, showing that conformational changes 
in the GBP1 GTPase domain are required for its 
dimerization (fig. S13C). Phosphorylated and 
14-3-30-bound GBPI1 did not dimerize (fig. 
$13C). Furthermore, the rate of GTP hydrol- 
ysis by GBP1 in a GBP1:14-3-30 complex was 
reduced by >8-fold as compared with hydrol- 
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ysis by free GBP1 (Fig. 3D). Phosphorylation of 
GBPI1 without 14-3-30 binding did not influence 
the rate of GTP hydrolysis (Fig. 3D). 

To visualize the interaction between phos- 
phorylated GBP1 and 14-3-30, we solved the 
structure of the purified protein complex (fig. 
S13B) using single-particle cryo-electron mi- 
croscopy (cryo-EM; Fig. 3E and fig. S14 and 
table S1 for details). We obtained a cryo-EM 
map of the GBP1:14-3-3o heterotrimer at 5-A 
resolution, which confirmed that one 14-3-36 
homodimer bound to a single copy of GBP1 by 
grabbing its GTPase domain and interacting 
with the linker helix that connects the helical 
domain to the GTPase domain (Fig. 3E). Al- 
though the overall resolution did not permit 
de novo modeling, we were able to unambigu- 
ously dock previously reported crystal structures 
as rigid bodies based on secondary-structure 
features (Fig. 3E). Close interactions were ob- 
served for GBP! loop Arg’*’ to Thr’ with the 
termini of helices 8 and 9 of 14-3-30 copy 1. Fur- 
thermore, 14-3-3c helices 5 and 9 were in con- 
tact with the parallel-running linker helix of 
GBP1 (Fig. 3E). In this position, the phosphate 
binding site of 14-3-30 copy 2 was in close 
proximity to GBP1 Thr®”°, for which we also 
observed phosphorylation (Fig. 2F) but no ef- 
fect on 14-3-3 binding upon mutation of GBP1 
Thr”? (fig. SIOB). Thus, we hypothesize that 
14-3-30 binding to GBP1 functions in a similar 
way as 14-3-3 sequestration of Rnd3 GTPase 
(29), by requiring two phosphorylation sites 
and the farnesyl moiety. For GBP1, both Ser’®® 
and Thr”? would be involved in sequential 
two-step 14-3-3-binding, with the initial recog- 
nition by Ser’’® and subsequent stabilization 
by binding to Thr’. In summary, our model 
suggests that phosphorylated GBP1 is seques- 
tered by 14-3-30, which acts as a molecular 
padlock to keep the protein inactive through 
inhibition of dimerization and GTPase activ- 


ity (Fig. 3F). 
PIM1 and 14-3-3o control GBP1 activities 


Having identified a role for PIM1 and 14-3-30 in 
inhibiting the activity of GBP1, we next asked 
whether PIM1 and 14-3-3o protected cells from 
the toxic effects of GBP1. We used CRISPR-Cas9 
to delete 14-3-36 (encoded by the gene SFN, 
also called YWHAS; fig. S7A) and reconstituted 
the cells with Dox-inducible 14-3-36 (fig. S7C). 
Next, we measured the long-term survival of 
cells forced to express GBP1 mutants that can- 
not be recognized and inactivated by PIM1 
(GBPIREYYRISBA/KISSA) OF 143-36 (GBP AY PRAY, 
Similar to WT GBP! (Fig. 1A), ectopic expres- 
sion of these GBP1 mutants led to complete cell 
loss, pointing to their toxicity (Fig. 4A). Similar 
to phosphorylation-deficient Gppree* (Fig. 1E), 
IFN-y treatment was unable to prevent cell death 
(Fig. 4A). We then measured the long-term sur- 
vival of WT, AGBPI, APIM1, APIM1/GBP1, Al4- 
3-36, and A14-3-30/GBP1 cells. IFN-y treatment 
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of APIMI cells led to cell loss, as previously ob- 
served (fig. S9D), as did IFN-y treatment of A74- 
3-36 cells (Fig. 4B). In both cases, cell loss was 
prevented by additional knockout of GBPI1, 
whereas IFN-y treatment of THP-1 WT or AGBPI 
cells did not reduce viability (Fig. 4B). More- 
over, both GBP1 mutants and endogenous 
GBP1 in APIM]7 or AJ4-3-36 cells were prone 
to dimerization (Fig. 4C) and localization to 
cellular membranes in IFN-y-exposed mac- 
rophages (Fig. 4D). Again, this resembled the 
phosphorylation-deficient S156A mutant (Fig. 1, 
F and G). 

To test whether this mechanism holds true 
at the tissue level, we used IFN-y-responsive, 
patient-derived colorectal tumor organoids 
and tested whether the GBP1:PIM1 interac- 
tion inhibitor NSC756093 affected cell via- 
bility, organoid growth, and stemness. IFN-y 
treatment alone had no impact on organoid 
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survival or growth, and the inhibitor NSC756093 
alone only caused mild cytotoxicity (Fig. 4E). 
However, treatment with a combination of 
IFN-y and inhibitor (thereby unleashing GBP1 
from its phosphorylation-driven control) de- 
creased organoid growth by ~60% over the 
course of 4 days (Fig. 4E). We confirmed these 
findings by visual inspection, observing fewer 
and smaller organoids upon treatment with 
IFN-y and NSC756093 compared with untreated 
or IFN-y- or inhibitor-only treated samples. 
(Fig. 4F). To further interrogate the effects of 
treatment with IFN-y and NSC756093, we next 
evaluated the stemness of organoids using stem- 
forming assays. To do so, organoids pretreated 
with IFN-y and NSC756093 for 48 hours were 
dissociated, reseeded, and left to grow back 
into organoids for 7 days without any addi- 
tional treatment (Fig. 4, G and H). This re- 
vealed a lasting effect of IFN-y and NSC756093 
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on the ability of organoid precursors to self- 
renew and illustrates the potency of GBP1 in 
colorectal tumor organoids (Fig. 4, G and H). 
In summary, PIM1 and 14-3-30 are required 
to control the activity of GBP1 in human cells 
by phosphorylation and cytosolic sequestration. 


Toxoplasma infection depletes PIM1 
and activates GBP1 


We hypothesized that in the absence of phos- 
phorylation of GBP1 by PIM1 and its sequestra- 
tion by 14-3-30, GBP1 is free to target pathogen 
compartments more effectively. To test how 
control of GBP1 affects its trafficking and anti- 
microbial activities, we quantified its ability to 
restrict growth of Tg. We tested cells that ex- 
pressed GBP1 mutants with impaired phospho- 
rylation (S156A and R151A/R153A/K155A) or 
binding of 14-3-30 (R153A/P158A), cells that lacked 
PIMI (or expressing kinase-dead PIM1°®"), and 
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cells that lacked 14-3-30. All displayed increased 
recruitment of GBP1 to Tg vacuoles, restricted 
the growth of Tg more efficiently, and showed 
increased disruption of Tg-containing vacu- 
oles and parasites, with a subsequent higher 
induction of apoptosis as compared with IFN-y- 
treated THP-1 WT controls (Fig. 5A). These re- 
sults indicate increased parasiticidal activity of 
GBP1 in conditions that prevent GBP1 phos- 
phorylation and/or its interaction with 14-3- 
30. Reconstituting knockout cells with either 
WT PIM1 or 14-3-30 decreased all pathogen con- 
trol measures back to THP-1 WT levels (Fig. 5A). 

When assessing the dynamics of the asso- 
ciation of GBP1 with Tg-containing vacuoles, 
GBP1 was recruited faster and to higher levels 
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data, see data S5. 


in both APIM7 and NSC756093-treated THP-1 
WT cells (fig. S15, A and B). Fluorescence recov- 
ery after photobleaching assays (FRAP) showed 
increased mobility of mCH-tagged GBP1 in APIM1 
cells or cells that express mCH-GBP1S, which 
could explain the faster kinetics of GBP1 recruit- 
ment to Tg vacuoles (fig. S15C). Conversely, the 
silencing of PPPICA impaired the growth con- 
trol of Tg, whereas treatment with NSC756093 
improved it. Depletion of PIM2 or PIM3 affected 
neither GBP1 recruitment to Tg vacuoles nor 
growth control of Tg (fig. S15, D to F). 

We next sought to determine how GBP! phos- 
phorylation and protein abundance changed in 
Tg-infected cells. We first measured the induc- 
tion of GBP1, 14-3-30, and PIM1 as well as the 
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are representative of n = 3 experiments. Graphs in (A), (B), (D), (E), and 

(G) show mean + SEM from n = 3 experiments, normalized to untreated cells. 
*P < 0.05, ***P < 0.001, and ****P < 0.0001 in (A), (B), and (E) comparing 
all conditions with WT or untreated cells and for indicated comparisons 

in (E) and (G) from one-way ANOVA; ns is not significant. For gel source 


appearance of Ser’®°°-phosphorylated GBP1 in 


THP-1 WT cells after IFN-y treatment (Fig. 5B). 
This time course showed a more rapid induc- 
tion of PIM1 than GBP1. Contrary to our RT- 
qPCR data (fig. S10C), these results showed that 
the levels of 14-3-30 did not change (Fig. 5B). 
Notably, GBP1 appeared to be phosphorylated 
soon after it was expressed (Fig. 5B), indicating 
that the PIM1 and 14-3-30-mediated control of 
GBP1 was immediately active. PIM1 protein 
and mRNA were unstable and short-lived [half- 
life (t1/2) ~8.5 or ~22 min, respectively] as com- 
pared with the more-stable GBP1 (protein ty. > 
12 hours; Fig. 5, C and D). This led us to hy- 
pothesize that PIM1 degradation might serve to 
uncouple the inhibition of GBP1. In this way, 
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Fig. 5. Toxoplasma infection 
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(F), and (H) are representative of n = 3 experiments. Graphs in (A) show 


IFN-y-induced PIMI serves a dual role: It func- 
tions as a guard that detects pathogen-mediated 
interference of IFN-y signaling (leading to de- 
pletion of PIM1), and it protects self-membranes 
from damage inflicted by GBP1. Tg secretes a 
pathogen effector protein called T. gondii in- 
hibitor of STATI transcriptional activity (TgIST). 
Upon entering the host cell nucleus, TgIST se- 
questers STATI on specific loci and facilitates 
the formation of nonpermissive chromatin by 
recruiting the nucleosome remodeling deacet- 
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Tg, PIM1, Nuclei 


ylase (NuRD) repressor complex. The nascent 
transcription sites thereby remain silenced 
(30, 31). Given the IFN-y-inducible nature of the 
PIMI1 gene, we hypothesized that TgIST com- 
pletely shuts down PIM]7 gene expression, lead- 
ing to the loss of PIM1. Indeed, we observed 
rapid TgIST-dependent depletion of PIM1 upon 
infection with Tg (Fig. 5E), further precipitat- 
ing the loss of GBP1 phosphorylation and 
interaction with 14-3-3o (Fig. 5F). MS of GBP1 
interactomes of Tg-infected cells confirmed 
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mean + SD of n = 3 experiments. Graphs in (B) and (C) show mean + SD from 
n = 3 experiments, and graphs in (D) and (E) show mean + SD from n = 4 
experiments. Graphs in (G) show data from n = 3 replicates. *P < 0.05, **P < 
0.01, ***P < 0.001, and ****P < 0.0001 in (A) from two-way ANOVA comparing 
all conditions with THP-1 WT cells and in (H) from nested t test comparing 
infected with uninfected cells after adjustment for multiple comparisons; ns is 
not significant. For gel source data, see data S5. 


decreased interaction of GBP1 with 14-3-30 
and showed increased interaction with host- 
derived actin and parasite-derived TgROP5 
compared with uninfected cells (Fig. 5G and 
data S4). These experiments show that the 
activity of GBP1 is restrained in uninfected cells 
but can be rapidly triggered upon infection by a 
pathogen that interferes with IFN-y signaling. 
Supporting our hypothesis of bystander cell 
protection, we found that PIM1 was depleted 
specifically only in Tg WT-infected macrophages 
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but not in uninfected cells from that same exper- 
iment or in A7g/ST-infected cells (Fig. 5H). 
Finally, we wanted to investigate the effect 
of PIM1 and 14-3-30 on GBP! activity against 
Gram-negative bacteria such as Salmonella 
Typhimurium (STm), which is distinct from 
its activity against vacuolar pathogens such as 
Tg. For comparison, we assessed the cellular 
distribution of PIM1, 14-3-30, GBP1, and Ser’®- 
phosphorylated GBP1 in cells infected with Tg 
or STm by immunofluorescence imaging (fig. 
S16). In uninfected cells, all proteins showed 
a high degree of colocalization (Pearson’s co- 
efficient >0.88; fig. S16A). In Tg-infected cells, 
we observed the rapid recruitment of GBP1 to 
Tg vacuoles but no recruitment of PIM1, 14-3-30, 
or GBPIpS156 (GBP1 phosphorylated at Ser’®) 
(fig. S16B). This corroborated our finding that 
GBP1 must be dephosphorylated and released 
from sequestration before acquiring its para- 
siticidal, pathogen-proximal activity. We also 
observed decoration of the bacterial surface of 
cytosolic STm by GBP1 alongside recruitment of 
PIMI, 14-3-30, and GBP1pS156 (fig. SI6C), sug- 
gesting that GBPI’s high-affinity to LPS might 
override control by PIM1-mediated phospho- 
rylation and 14-3-30 sequestration. Supporting 
this finding, we did not observe degradation of 
PIMI or dephosphorylation of GBP1 upon STm 
infection (fig. S16D). Finally, in A74-3-3o cells, 
phosphorylated GBP1 was recruited to Tg vacu- 
oles, demonstrating that retention by 14-3-30 
in the cytosol, not phosphorylation by PIM1 
alone, ultimately controlled GBP1 pathogen- 
compartment targeting (fig. SIGE). 


Discussion 


Human GBP1 is amembrane-active large GTPase 
that can disrupt microbe-encapsulating mem- 
branes (5-15). However, GBP1 can also associ- 
ate with the Golgi apparatus (16, 20) and/or 
the plasma membrane (13, 16). How GBP1 ac- 
tivity is regulated in cells and how it is recruited 
to specific subcellular sites has remained elu- 
sive. In this work, we found that a large frac- 
tion of GBP1 is phosphorylated by the kinase 
PIMI at Ser’®, establishing that posttransla- 
tional modification of human GBP1 regulates 
its activity. Ser’°°-phosphorylated GBP1 was 
sequestered in the cytosol by 14-3-30, thereby 
preventing self-inflicted damage, in a mechanism 
reminiscent of how Rnd3 GTPase is inactivated 
(29). Thus, IFN-y-exposed macrophages are in 
an activated state but keep GBP1 on standby and 
its activity tightly controlled. In uninfected cells, 
ectopic expression and unrestrained action of 
GBP1 led to necrosis. Similarly, ectopic expres- 
sion of GBP1 in ovarian cancer cells reduces 
their viability by ~25% within 3 days (32). In 
infected cells, the Tg-induced block of IFN-y 
signaling-induced transcription depletes in- 
herently short-lived PIM1 (33) and springs a 
trap that unleashes GBP1 onto the pathogen- 
containing compartment. Pathogens from all 
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kingdoms of life have evolved effector proteins 
that interfere with the integrity of IFN-y signal- 
ing. Some examples include Mycobacterium 
tuberculosis blocking transcriptional transac- 
tivation downstream of STAT1 (34), Vaccinia 
virus phosphatase H1 targeting STATI for 
dephosphorylation (35), human cytomegalo- 
virus depleting JAK kinases (36), and hepatitis 
C virus NSS5A protein or Leishmania donovani 
both disrupting STAT1 phosphorylation (37, 38). 
The cytokine-induced kinase PIM1 may thus 
serve as a detector for a broad range of path- 
ogens that interfere with IFN-y signaling. Con- 
sistent with our work, PIM1 is a host factor that 
favors survival of Tg (39, 40). Furthermore, PIM1 
is a known GBP1-interacting protein in cancer 
cells (27) and is a prosurvival proto-oncogene 
that negatively regulates other proapoptotic pro- 
teins (24, 26, 41-44). Given GBP1’s ability to pro- 
mote programmed cell death, use of the specific 
GBP1:PIM1 interaction inhibitor (27) is a promis- 
ing way to chemically activate GBP1, with impli- 
cations for innate immunity and cancer therapy. 

Innate immune mechanisms that can cause 
cellular damage must be carefully regulated to 
ensure protection of bystander cells. The activ- 
ity of immune effectors is tightly controlled, 
either indirectly through inactivation of inhib- 
itory guard proteins or directly through activ- 
ation of upstream PRRs. The indirect guard 
mechanisms, originally described in plants 
(45), detect the interference of pathogens with 
key cellular processes, whereas the direct PRR 
mechanisms respond to conserved pathogen- 
associated molecular patterns. An example of 
such “guarding” in Mammalia that is concep- 
tually similar to the inactivation of GBP1 is the 
regulation of the Pyrin inflammasome, where- 
by the cytoskeleton-sensing kinases PKN1 and 
2 phosphorylate Pyrin and inactivate the pro- 
tein through the binding of 14-3-3 proteins. 
This system relays microbial perturbation of 
the cytoskeleton and thereby indirectly trig- 
gers immunity (46-49). Other recent examples 
include “self-guarding” of MORC3, which con- 
trols HSV-1 infection and secondary type I IFN 
responses (50), or viral disruption of protein 
synthesis that triggers apoptosis through the 
depletion of short-lived BCL-2 protein family 
members (57). Given the short half-life of PIM1, 
it is tempting to speculate that translation in- 
hibition might also unleash GBP1 from its re- 
striction. GBP1 directly activates immunity 
through its bona fide PRR function for cyto- 
solic LPS (7, 11, 13-15). As we found in this work, 
its other, parasiticidal function is controlled by 
PIM1. Thus, GBP1 combines both innate im- 
mune surveillance strategies and antimicro- 
bial activity in a single effector protein. The 
integration of an IFN-y-induced pathway with 
posttranslational control of innate immune 
defense could cover a broad range of patho- 
gens and enable cells to autonomously regu- 
late immunity depending on their infection 
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status. Similar cytokine-induced guard circuitries 
are likely to exist and thus ensure the protection 
of innocent bystander cells. 


Materials and methods 
Cell, parasite, and bacteria culture, infections, 
and treatments 


THP-1 cells (TIB202, ATCC) were maintained in 
Roswell Park Memorial Institute (RPMI) medium 
with GlutaMAX (35050061, Gibco) supplemented 
with 10% fetal bovine serum (FBS) (Sigma), at 
37°C in 5% COs. THP-1 cells were differentiated 
with 50 ng/ml phorbol 12-myristate 13-acetate 
(PMA) (P1585, Sigma) for 3 days and left to rest 
for 2 days by replacing the differentiation me- 
dium with complete medium without PMA. 
Human embryonic kidney 293T (HEK293T) 
cells (Cell Services, The Francis Crick Institute, 
London, UK), NIH3T3 cells (CRL-1658, ATCC), 
and human foreskin fibroblasts (HFFs) (SCRC- 
1041, ATCC) were cultured in Dulbecco’s modi- 
fied Eagle’s medium (DMEM) with GlutaMAX 
(Gibco) supplemented with 10% FBS (Sigma) 
at 37°C in 5% COs. Cells were not used beyond 
passage 25 and HFFs not beyond passage 15. 

Patient-derived colorectal tumor organoids 
(Organoid 389), originally derived from a T3N1 
poorly differentiated, mismatch repair-proficient 
adenocarcinoma of the sigmoid colon from a 
male subject (52), were grown and maintained 
in 50 ul of Matrigel domes (CLS356231, Corning) 
with 500 ul of human IntestiCult medium (06010, 
StemCell) in 24-well plates at 37°C and 5% CO. 
and passaged weekly. 

Tg were maintained by serial passage on 
HFF cells that were cultured in DMEM with 
GlutaMAX supplemented with 10% FBS at 37°C 
in 5% CO., Parasites were passaged the day be- 
fore infection to maintain high viability. Tg were 
prepared from freshly 25-gauge-syringe-lysed 
HFF cultures by centrifugation at 50g for 3 min; 
the cleared supernatant was transferred into a 
new tube and followed by subsequent centrif- 
ugation at 500g for 7 min and resuspension of 
the pelleted parasites into fresh complete me- 
dium. Parasites were added to the cells at a 
multiplicity of infection (MOD of 1. Infection 
was synchronized by centrifugation at 500g 
for 5 min. Two hours after infection, extracel- 
lular parasites were removed with three washes 
using warm phosphate-buffered saline (PBS) 
(806552, Sigma), and fresh complete medium 
was added before culturing at 37°C and 5% CO, 
for the required time. 

STm strain SL1344-GFP (GFP, green fluores- 
cent protein) (17) was maintained under am- 
picillin selection (11593027, Gibco) and grown 
on LB plus ampicillin agar plates. One day be- 
fore infection, bacteria from a single colony 
were inoculated and grown overnight at 37°C. 
The overnight culture was diluted 1:20 into LB 
plus 300 mM NaCl and grown with shaking 
in a closed container to an optical density at 
600 nm (OD¢po9) of 0.9. Bacteria were harvested 
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by centrifugation at 1000g for 5 min, washed 
with serum-free cell culture medium twice, and 
resuspended in 1 ml of plain medium. Cells 
were infected with STm at an MOI of 10, and 
infections were synchronized by centrifugation 
at 750g for 10 min. Infected cells were washed 
three times with warm PBS 30 min after in- 
fection, and fresh medium containing 10 pg/ml 
gentamicin (15750060, Gibco) was added. The 
bacterial MOI used for infections was confirmed 
by plating on LB agar plates. An overview of all 
cell lines, parasites, and bacteria strains can be 
found in table S2. All cell culture work was 
performed without the addition of antibiotics, 
and the cells were regularly tested for myco- 
plasma contamination by immunofluorescence 
and PCR. 

THP-1 cells were stimulated for 16 hours be- 
fore infection in complete medium with addi- 
tion of 50 international units (TU)/ml human 
IFN-y (285-IF, R&D Systems). Induction of 
protein expression in the Dox-inducible cells 
was performed with 200 ng/ml Dox (D9891, 
Sigma) overnight. To block translation for study- 
ing protein stability, cells were treated with 
50 ug/ml cycloheximide (CHX) (C7698, Sigma), 
and to block transcription, cells were treated 
with 6 ng/ml actinomycin D (A1410, Sigma). To 
block the GBP1:PIM1 interaction, cells were 
treated with 100 nM NSC756093 (SM1L1310, 
Sigma). For in vitro assays, NSC756093 was 
used at 1 uM. To chemically induce pyroptosis, 
cells were primed with 100 ng/ml LPS O111:B4 
(NC9673121, Enzo) and subsequently treated 
with 10 uM nigericin. Treatments to induce 
apoptosis were performed with 50 ng/ml tu- 
mor necrosis factor-a (TNFa) (210-TA, R&D 
Systems) and 10 ug/ml CHX. The pan-caspase 
inhibitors zVAD-fmk (25 uM, 60332, Cell Sig- 
naling Technologies) or qVD (20 uM, ab141421, 
Abcam) were used to block caspase activation 
for necroptosis induction or blocking apopto- 
sis or pyroptosis. To inhibit pyroptosis, cells 
were treated with 10 uM Mcc950 (inh-mcc, 
InvivoGen) or 50 uM disulfiram (DSF) (1224008, 
Sigma). To inhibit necroptosis, cells were treated 
with 50 uM GSK’872 (6492, Tocris) or 10 uM 
necrosulfonamide (NSA) (5025, Tocris). Tumor 
organoid cultures were treated with 100 IU/ml 
IFN-y or 1.66 uM NSC756093. 


Plasmid DNA and siRNA transfection 


THP-1 cells were transfected with 30 nM siRNAs 
2 days before infection (table S3). The transfec- 
tion mix was prepared as a 10x mix in OptiMEM 
containing the siRNA(s) and TransIT-X2 trans- 
fection reagent (MIR600x, Mirus) in a 1:2 ratio. 
All siRNAs were ON-TARGETplus pools from 
Dharmacon, with the negative control being 
ON-TARGETplus Non-targeting Pool (D-001810, 
Dharmacon). Transfection of DNA plasmids was 
performed using Lipofectamine 2000, follow- 
ing the manufacturer’s instructions (11668027, 
Invitrogen). 
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Lentiviral transduction 

Lentiviral packaging used HEK293T cells trans- 
fected with equimolar ratios of the required 
expression plasmid, pMD2.G, and psPAX2 
(Addgene, 12259 and 12260, respectively, both 
gifts from D. Trono) using Lipofectamine 2000 
in serum-free DMEM. Medium was replaced 
12 hours after transfection with fresh DMEM 
plus 10% FBS to rest the cells and again after 
12 hours with DMEM plus 10% FBS contain- 
ing 5 mM sodium butyrate (B5887, Sigma). The 
next day, medium was replaced with RPMI 
medium plus 10% FBS, and the cells were left 
to produce lentiviral particles for 1 day. Virus 
containing supernatant was filtered through 
a 0.43-um syringe filter and supplemented with 
8 uM polybrene (H9268, Sigma). The target 
cells were then resuspended in 500 ul of the 
virus-containing medium and “spinfected” for 
30 min at 1000g. One hour after infection, 1 ml 
of complete medium was added, and the cells 
were left to rest. The infection procedure was 
repeated a total of three times before selection 
with the appropriate selection reagent: 1 ug/ml 
puromycin (A1113802, Gibco), 200 g/ml zeocin 
(J67140, Alfa Aesar), and/or 15 ug/ml blasticidin 
S (15205, Sigma). Once untransduced con- 
trol cells had died in the selection medium, the 
newly created cells were verified for success- 
ful transduction by immunoblotting. 


Creation of new cell lines 
Inducible GBP1 cell lines 


THP-1AGBPI, THP-1AGBP1+Tet-GBP1 WT, THP- 
1AGBP!I+Tet-Flag-GBPI WT, and THP-1AGBPI+ 
Tet-mCH-GBPI WT have been published be- 
fore, and new cell lines were created identically 
using lentiviral transductions (11). To make 
cells expressing mutated GBP1, pLenti-Tet-GBP1 
plasmids (17), with and without tags, were mu- 
tated using site-directed mutagenesis and trans- 
duced into the THP-IAGBP/+Tet target cells using 
lentiviral transduction as described before. 


PIMI1 and 14-3-36 (SFN/YWHAS) CRISPR 
knockout cells 


Guide RNA (gRNA) sequences targeting the 
5’ and 3’ untranslated regions (UTRs) of the 
respective genes were designed using the MIT 
CRISPR design tool (crispr.mit.edu) (table S4). 
DNA oligonucleotides encoding the CRISPR 
RNA (crRNA) were annealed and cloned into 
BsmBI-digested (ERO451, Thermo Fisher Scien- 
tific) pLentiCRISPR-V2 backbone (53) using 
Quick Ligation kit (M2200, New England Bio- 
labs) and transduced into THP-1 WT or THP- 
1AGBPI cells using lentiviral particles. After 
selection with puromycin for 7 days, cells were 
subcloned by serial dilution into 96-well plates 
using preconditioned complete medium sup- 
plemented with nonessential amino acids 
(11140076, Gibco), penicillin and streptomycin, 
and GlutaMAX. Roughly 3 weeks after seeding, 
obtained clones were expanded and screened 
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for the absence of target protein expression by 
RT-qPCR. Clones that showed reduced or ab- 
sent target protein expression underwent sec- 
ondary screening by immunoblotting. For each 
of the cell lines, >4 single-cell clones with con- 
firmed absence of either PIM] or 14-3-30 pro- 
tein were pooled and cultured for 2 more weeks 
before undergoing final screening to confirm 
absence of the proteins. This resulted in the 
THP-1APIM17, APIM1/GBPI, Al14-3-30, and Al4- 
3-30/GBP1 cell lines. 


Reconstitution of PIM1 and 14-3-30 


CRISPR knockout cells of PIM1 and 14-3-36 
were transduced with the Dox-inducible sys- 
tem as previously described (11). To create PIM1- 
and 14-3-30-expressing Dox-inducible plasmids 
(pLenti-Tet-PZM7 or pLenti-Tet-14-3-30), the em- 
pty vector backbone was digested with BamHI, 
PIM1 and 14-3-30 open reading frames (ORFs) 
were amplified from cDNA obtained from IFN-y- 
primed THP-1 WT cells, and all fragments as- 
sembled with Gibson assembly. Further, the 
pLenti-Tet-PIM1 plasmid was mutated by site- 
directed mutagenesis to obtain a pLenti-Tet- 
PIM1""’ kinase-dead version, and all vectors 
transduced into the respective target cells using 
lentiviral transduction. 


RT-qPCR 


RNA was extracted from 0.25 x 10° cells using 
Trizol reagent (15596026, Invitrogen). GlycoBlue 
(6 ug/ml, AM9516, Invitrogen) was added during 
the isopropanol precipitation step. RNA quality 
was measured on a Nanodrop 2000 Spectro- 
photometer (Thermo Fisher Scientific). One mi- 
crogram of RNA was reverse transcribed using 
a high-capacity cDNA synthesis kit (4368813, 
Applied Biosystems). For qPCR, the PowerUP 
SYBR green kit (A25742, Applied Biosystems), 
20 ng cDNA in a 10-ul reaction, and primers at 
1 uM final concentration (sequences in table 
S5) were used. Primer specificity was ensured 
by designing primers to span exon-exon junc- 
tions, whenever possible, and for each primer 
pair, a melt curve was recorded and compared 
with in silico predicted melt curves from uMelt 
(54) and amplicon sizes were analyzed by agar- 
ose gel electrophoresis. Recorded cycle thresh- 
old (Ct) values were normalized to the recorded 
Ct of human HPRT17, and data were plotted as 
ACt (relative expression). To determine abso- 
lute expression, defined amounts of linearized 
plasmid standards were used as PCR template 
and the obtained Ct values used to calculate 
transcript numbers. 


MTT, XTT, and AnnV glow assays for cell survival 
and death 


To determine cell viability using XTT assays 
(Cell Proliferation Kit II, Roche), 25,000 cells were 
seeded per well in a 96-well plate or 10,000 cells 
in a 384-well plate and differentiated as de- 
scribed before. Cells were treated in RPMI 
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without phenol red (11835, Gibco). The detec- 
tion reagent was freshly prepared according to 
the manufacturer’s instructions and added be- 
fore the viability determination. Cells were then 
incubated at 37°C for 4 hours, and absorption 
was measured at 475 and 660 nm for correction. 

For organoid growth analysis, MTT reagent 
(M6494, Invitrogen) was added in a 1:10 dilu- 
tion to the plates, and the Matrigel domes were 
disrupted by pipetting. Plates were returned 
to a 37°C incubator for 30 min before adding 
lysis solution (4 mM HCI, 0.1% Triton-X100 in 
isopropanol), mixing by pipetting, and placing 
on a shaker in the dark for 10 min. Each well 
was thoroughly mixed, lysates were transferred 
in quadruplicates to a microplate, and absorb- 
ance was read at 550 nm on an iMark Micro- 
plate Absorbance Reader (Bio-Rad). Medium 
with Matrigel was used as a negative control. 

Apoptosis kinetics were analyzed using the 
RealTime-Glo Annexin V Apoptosis Assay (JA1001, 
Promega) according to the manufacturer’s in- 
structions. In brief, 50,000 cells were seeded 
per well of a white, tissue culture-treated 96-well 
plate, differentiated, pretreated, and infected. 
Simultaneously with infection, detection reagent 
was added. Luminescence was measured using a 
Fluostar Omega plate reader (BMG Labtech) 
that was preheated to 37°C. No-cell, medium- 
only controls were used for background correc- 
tion. To determine overall cell death levels, the 
area under the curve (AUC) was determined 
using Prism 8.4 (GraphPad, Inc.). 


Molecular cloning of vectors for transient expression 


For transient expression of Flag-GBP1 after trans- 
fection, the ORF was amplified by PCR using Q5 
polymerase and cloned into BamHI- and EcoRI- 
digested pcDNA3.1(+) vector using the Quick 
Ligation kit. pcDNA3-HA-14-3-38, y, €, C, and o 
were a gift from A. Ridley (University of Bristol, 
Bristol, UK) (29) and M. Yaffe (MIT, Boston, MA, 
USA) (55). To produce N-terminally HA-tagged 
14-3-3n and 14-3-30 in a pcDNA3 vector identical 
to the other constructs, the pceDNA3-HA back- 
bone was amplified by PCR. 14-3-3n was ampli- 
fied from pCS2-HA-14-3-3n (Addgene, 116887, a 
gift from F-Q, Li and K.-I. Takemaru) (56) and 
14-3-30 from pcDNA3-14-3-30-HA (a gift from 
A. Ridley), and finally, inserts and backbone 
were assembled by Gibson assembly. 


SDS-PAGE, immunoblotting, and gel staining 


For immunoblotting, 0.5 x 10° cells were seeded 
per well of a 48-well plate, differentiated with 
PMA, pretreated, and infected as described 
above. At the end of treatments, cells were 
washed with ice-cold PBS and lysed for 5 min 
on ice in 50 ul of RIPA buffer (150 mM NaCl, 
1% Nonidet P-40, 0.5% sodium deoxycholate, 
0.1% SDS, 25 mM Tris-HCl pH 7.4) supple- 
mented with protease inhibitors (Protease 
Inhibitor Cocktail set III, EDTA free, Merck) and 
PhosSTOP phosphatase inhibitors (4906845001, 
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Roche). Lysates were cleared by centrifugation 
at full speed for 15 min at 4°C. Subcellular frac- 
tionation was performed using the QProteome 
Cell Compartment Kit (37502, Qiagen) following 
the manufacturer’s instructions. Bicinchoninic 
acid (BCA) assay (Pierce BCA protein assay kit, 
23225, Thermo Fisher Scientific) was performed 
to determine protein concentrations. Ten micro- 
grams of total protein per sample were mixed 
with Laemmli buffer (1610737, BioRad) con- 
taining 5% dithiothreitol (DTT) and denatured 
at 95°C for 10 min and then run on Bis-Tris gels 
(Novex, Invitrogen) in MOPS running buffer. 
For immunoblots of culture supernatants, cells 
were treated in OptiMEM (1105802, Gibco). 
Proteins in the supernatants were precipitated 
with 4 Vol acetone (V800023, Sigma) overnight 
at -20°C and pelleted by centrifugation. Pellets 
were air dried for 10 min, resuspended in 40 pl 
of 2x Laemmli loading dye, denatured, and 
used for immunoblotting. 

After SDS-polyacrylamide gel electrophore- 
sis (SDS-PAGE), proteins were transferred onto 
nitrocellulose membranes using the iBlot trans- 
fer system (Invitrogen). Depending on the pri- 
mary antibody used, the membranes were 
blocked with either 5% bovine serum albumin 
(BSA) (A2058, Sigma) or 5% dry-milk (M7409, 
Sigma) in TBS-T (0.05% Tween-20) for at least 
1 hour at room temperature. Incubation with 
primary antibodies (table S6) was performed 
at 4°C overnight. Blots were developed by wash- 
ing with TBS-T, probed with 1:5000 diluted 
HRP-conjugated secondary antibodies, washed 
again and imaged on a ChemiDoc MP imaging 
system (BioRad) using enhanced chemilumines- 
cence (ECL) (Immobilon Western, WBKLS0500, 
Millipore). For quantification of protein band 
intensities, images were quantified using FIJI/ 
ImageJ and normalized to the actin loading 
control of each membrane. 

For silver staining of protein gels, after SDS- 
PAGE, the gels were washed in ddH,O and then 
silver stained following the manufacturer’s in- 
structions (Silver Stain Plus Kit, 1610449, BioRad). 
Similarly, washed gels were used for ProQ Di- 
amond staining following the manufacturer’s 
instructions (Pro-Q Diamond Phosphoprotein 
Gel Stain, P33300, Invitrogen). Stained gels 
were imaged on a ChemiDoc MP imaging sys- 
tem (BioRad). Similar to chemiluminescence 
quantification, protein bands from stained gels 
were quantified using FIJI ImageJ. Coomassie 
stain of SDS-PAGE gels was performed using 
InstantBlue Coomassie Protein Stain (ISBIL, 
Abcam). 


GBP1 oligomerization assay 


To assess GBP1 oligomerization, a protocol pre- 
viously published for ASC oligomerization (57) 
was adapted. In brief, cells were seeded and 
treated as described above but harvested by 
scraping in ice-cold PBS containing 2 mM EDTA. 
Next, cell pellets were washed with PBS plus 
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2 mM EDTA and resuspended in 200 ul of 
buffer A (20 mM HEPES-KOH pH 7.5, 10 mM 
KCl, 1.5 mM MgCl, 1 mM EDTA, 1 mM EGTA, 
320 mM sucrose) and lysed by passing the sus- 
pension through a 25-gauge needle. Lysates 
were cleared by centrifugation at 1800g and 
4°C for 15 min. From this, sample was kept as 
lysate input control. The remaining superna- 
tant was diluted 2x with buffer A, and then 
400 ul of CHAPS buffer (20 mM HEPES-KOH 
pH 7.5,5 mM MgCh, 0.5 mM EGTA, 0.1% CHAPS) 
was added. Precipitated GBP1 was pelleted by 
centrifugation at ’7500g for 30 min. Finally, pellets 
were resuspended in 30 tl of CHAPS buffer with 
4 mM of disuccinimidyl suberate (DSS) (21655, 
Thermo Fisher Scientific) and left to cross-link 
for 30 min at room temperature, pelleted again 
by centrifugation at 7500g for 30 min and then 
dissolved in 2x Laemmli loading dye. Samples 
were then analyzed by immunoblotting. 


Immunoprecipitations 


For immunoprecipitations 5x10° cells were 
seeded in six-well plates and differentiated, 
pretreated, and infected as described above. The 
cells were washed in ice-cold PBS and scraped 
from the plates. Whole-cell lysates were pre- 
pared by adding 500 ul of lysis buffer (1% Triton 
X-100, 20 mM Tris-HCl pH 8.0, 130 mM NaCl, 
1mM DTT, 10 mM sodium fluoride) with added 
protease and phosphatase inhibitor cocktails 
and incubating on ice for 15 min. Lysates were 
cleared by centrifugation at 4°C for 10 min at 
full speed. 

For Flag and HA immunoprecipitations, 
cleared lysates were added to Flag(M2)-agarose 
beads (A2220, Sigma) or Pierce Anti-HA Agarose 
beads (26182, Thermo Fisher Scientific) pre- 
equilibrated by washing three times with lysis 
buffer. Proteins were captured by incubation on 
a rotator overnight at 4°C. Beads were washed 
once with lysis buffer, three times with lysis 
buffer containing 260 mM NaCl, and again 
twice with lysis buffer. Proteins were eluted 
using 200 ng/ml 3xFlag peptide (F4799, Sigma) 
or 200 ng/ml HA peptide (12149, Sigma) in 
lysis buffer by incubation on an orbital shaker 
(1400 rpm) for 2 hours at room temperature. 

For immunoprecipitation of endogenous 
GBP1/GBP1pS156, 2 ug of antibodies against 
the proteins and 50 ul of protein G sepharose 
beads (ab193259, Abcam) were added per 1 ml 
of lysates (~2 mg of protein) and incubated on 
a rotator overnight at 4°C. Beads were washed 
with lysis buffer five times, and proteins were 
eluted by acidification with 50 ul of 0.2 M gly- 
cine pH 2.0 followed by an immediate wash 
with 50 ul of lysis buffer. Input, unbound, and 
elution fractions were kept during immunopre- 
cipitations and analyzed by immunoblotting. 


Semiquantitative immunoprecipitation 


To determine the affinity of GBP1 binding to 14- 
3-3 proteins, we adapted a previously published 
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protocol for semiquantitative coimmunopre- 
cipitation (58). For this, HA-tagged 14-3-3 pro- 
teins were produced by transfection of HEK293T 
cells and purified by HA immunoprecipitation. 
Proteins were eluted using HA peptide and 
concentrated using 10-kDa cutoff Amicon Ultra-15 
Centrifugal Filters (UFC901024, Merck Millipore), 
exchanging the buffer with binding buffer 
(25 mM HEPES pH 7.3, 100 mM NaCl, 0.01% 
Triton X-100, 5% Glycerol, 1 mM DTT). Phos- 
phorylated GBP1pS156 was purified from IFN-y- 
and Dox-treated THP-IAGBP/+Tet-Flag-GBP1 
WT cells using the GBPIpS156 antibody, and the 
protein was concentrated with 50-kDa cutoff 
Amicon Ultra-15 Centrifugal Filters (UFC905024, 
Merck Millipore), exchanging the buffer with 
binding buffer. Next, the purified and phos- 
phorylated Flag-GBP1 was bound to Flag(M2)- 
agarose beads (A2220, Sigma) by rotating at 
4°C overnight and washed with binding buf- 
fer, and the beads were blocked with 50 mM 
ethanolamine in PBS. Finally, 10 ul of pre- 
pared and washed GBP1 bait beads were incu- 
bated with different concentrations of purified 
14-3-3 proteins for 1 hour at 4°C and then 
washed five times with binding buffer. GBP1 
and bound 14-3-3 proteins were eluted by boil- 
ing the beads in 2x Laemmli loading dye. Finally, 
obtained samples were analyzed by immu- 
noblotting, and blots were quantified using 
FIJI/ImageJ. 


MS analysis of GBP1 phosphorylation 


In vitro kinase assay samples (with ATP and 
without ATP) were aliquoted in three tech- 
nical replicates per condition and diluted with 
8 Murea to 25 mM Tris-HCl, 4 M urea, 5 mM 
MgCl. Proteins were reduced with 5 mM 
Tris(2-carboxyethyl)phosphine-hydrochloride 
(TCEP) for 30 min at 37°C and alkylated with 
10 mM iodoacetamide (IAA) for 30 min at 
room temperature in the dark. Subsequently, 
protein samples were loaded onto a 30-kDa mo- 
lecular weight cutoff filter (MWCO) (Sartorius, 
VIVACON 500, VNO1H22) and washed three 
times with 50 mM ammonium bicarbonate 
(Ambic) by centrifugation at 14,000g. Proteins 
were digested with GluC (Promega, V1651) in 
50 mM Ambic at a protease/protein (w/w) 
ratio of 1:20 at 37°C overnight. Peptides were 
eluted by centrifugation at 14,000g, and the 
30-kDa MWCO filter was washed once with 
50 mM Ambic. Peptides were cleaned with C18 
UltraMicroSpin columns (The Nest Group) fol- 
lowing the manufacturer’s recommendation 
and dried in a SpeedVac concentrator (Thermo 
Fisher Scientific). 

Peptides were reconstituted in 3% aceto- 
nitrile (ACN)/0.1% formic acid (FA)/H,O, and 
0.5 ug per sample was subjected to liquid 
chromatography-MS/MS (LC-MS/MS) using 
an nLC1000 coupled to an Orbitrap Fusion 
Tribrid mass spectrometer (Thermo Fisher 
Scientific). Peptides were loaded onto a 30-cm 
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column (75-um ID, New Objective) with mobile 
phase buffer A containing 0.1% FA/H2O and 
separated by reverse-phase liquid chromatog- 
raphy at a ReproSil-Pur 120 A C18-AQ 1.- um 
stationary phase (Dr. Maisch HPLC GmbH). 
Peptides were eluted with mobile phase buffer 
B, which consists of 99.9% ACN/0.1% FA, start- 
ing with 5% B and increasing up to 50% B in 
100 min. LC-MS/MS measurements were ac- 
quired in data-dependent mode. MSI spectra 
were recorded from 200 to 2000 m/z with a 
resolution of 120,000 at 200 m/z in the Orbitrap 
analyzer. MS1 scans were triggered at an AGC 
target of 1 x 10° and maximum injection time 
in auto mode. Within a cycle time of 3 s, pep- 
tides were iteratively isolated with a 1.3-m/z 
isolation window and fragmented by ETD 
with automatic charge-dependent determined 
ETD parameters. MS2 scans were recorded in 
the Orbitrap analyzer upon reaching the AGC 
target of 5 x 10* with injection time in auto 
mode. Fragment ions were monitored with a 
resolution of 60,000 at 200 m/z. Fragmented 
peptides were dynamically excluded from 
further analysis for 30 s. Ions with charge state 
unassigned, 1, or >6 were excluded from 
fragmentation. 

Data-dependent acquisition (DDA) raw data 
were processed using MaxQuant (v1.5.6) (59). 
Spectra were searched against the sequences for 
recombinant GBP1 and PIM1, as well as the pro- 
teome sequences of Escherichia coli (strain B / 
BL21-DE3) (tax ID: 469008) and Trichoplusia ni 
(tax ID: 7111), all retrieved from UniProtKB (ac- 
cessed on 30.05.2021), and common contamina- 
tions. Default parameters of MaxQuant were 
applied while choosing the protease GluC, allow- 
ing four miss cleavages, setting the variable 
modifications to Oxidation (M), Acetyl (Protein 
N-term), and Phospho (STY), and fixed mod- 
ification to Carbamidomethyl (C). A false discov- 
ery rate (FDR) of 0.01 was controlled at peptide 
spectrum match, posttranslational modification 
(PTM) site, and protein level by MaxQuant. Phos- 
phosite localization was automatically calculated 
by PTM Score of Andromeda within the MaxQuant 
proteomic suite. Phosphosite data were further 
analyzed with Perseus (v1.6.10.50). Phosphosites 
were filtered for a minimum localization prob- 
ability of >0.75 and minimum three measure- 
ments in total across all conditions. A peaklist of 
ETD-activated MS/MS spectrum for GBPIpS156, 
preprocessed by Andromeda/MaxQuant, was 
exported from the MaxQuant viewer and anno- 
tated with Interactive Peptide Spectral Annota- 
tor (60), considering c-, ze-, and y-type fragment 
ions. The mass spectrometry proteomics data 
have been deposited to the ProteomeXchange 
Consortium via the PRIDE (67) partner reposi- 
tory with the dataset identifier PXDO30010. 


Identification of GBP1-interacting proteins by MS 


Six-well plates were seeded with 10’ THP- 
1AGBP!I+Tet-Flag-GBPI cells , and the cells were 
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differentiated, pretreated with IFN-y and Dox, 
and infected with Tg. Cells were washed in 
ice-cold PBS, scraped from the plates, pelleted 
by centrifugation, and washed in PBS. Whole- 
cell lysates were prepared by adding 500 ul of 
lysis buffer (1% Triton X-100, 20 mM Tris-HCl 
pH 8.0, 130 mM NaCl, 1 mM DTT, 10 mM so- 
dium fluoride) with added protease and phos- 
phatase inhibitor cocktails and incubation for 
15 min on ice. Lysates were cleared by high- 
speed centrifugation and used for Flag im- 
munoprecipitation. The samples were run on a 
12% Bis-Tris polyacrylamide gel until the run- 
ning front had entered the gel ~5 mm. Sam- 
ples were excised from the gel, destained with 
50% acetonitrile/50 mM ammonium bicarbo- 
nate, reduced with 10 mM DTT, and alkylated 
with 55 mM IAA. After alkylation, proteins 
were digested with 250 ng of trypsin overnight 
at 37°C, and the peptides were extracted in 
2% formic acid and 1% acetonitrile and speed 
vacuum dried. 

The peptides were reconstituted in 50 ul of 
0.1% trifluoroacetic acid (TFA) before analysis 
and loaded on a 50-cm EASY-Spray column 
(75-um inner diameter, 2-um particle size, 
Thermo Fisher Scientific) equipped with an 
integrated electrospray emitter. Reverse-phase 
LC was performed using the RSLC nano U3000 
(Thermo Fisher Scientific) with a binary buffer 
system at a flow rate of 275 nl/min. Buffer A 
was 0.1% FA and 5% DMSO, and buffer B was 
80% acetonitrile, 0.1% FA, and 5% DMSO. The 
samples were run on a linear gradient of buffer B 
(2 to 30%) in 95.5 min. The nano LC was cou- 
pled to an Orbitrap Fusion Lumos mass spec- 
trometer using an EASY-Spray nano source 
(Thermo Fisher Scientific). The Orbitrap Fusion 
Lumos was operated in DDA mode acquiring 
MS1 scan (R = 120,000) in the Orbitrap, fol- 
lowed by higher-energy collisional dissociation 
(HCD) MS2 scans in the Ion Trap. Top Speed 
acquisition algorithm with a cycle time of 3 s 
was used to determine the number of selected 
precursor ions for fragmentation. The dynamic 
exclusion was set at 30 s. For ion accumulation, 
the MSI target was set to 4 x 10° ions and the 
MS2 target to 2 x 10° ions. The maximum ion 
injection time used for MS1 scans was 50 ms 
and that for MS2 scans was 300 ms. The HCD 
normalized collision energy was set at 28, and 
the ability to inject ions for all available paral- 
lelizable time was set to “true.” 

Orbitrap .RAW files were analyzed by MaxQuant 
(version 1.6.0.13) using Andromeda for peptide 
search. For identification, peptide length was 
set to seven amino acids, and match between 
runs was enabled. Parent ion and tandem mass 
spectra were searched against UniprotKB Homo 
sapiens (tax ID: 9606) and Tg (tax ID: 1080348) 
databases. For the search, the enzyme spe- 
cificity was set to trypsin, with a maximum 
of two missed cleavages. The precursor mass 
tolerance was set to 20 parts per million (ppm) 
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for the initial search (used for mass recalibra- 
tion) and to 6 ppm for the main search. Product 
mass tolerance was set to 20 ppm. Fixed mod- 
ification was defined as cysteine carbamidome- 
thylation. Methionine oxidation and N-terminal 
protein acetylation were searched as variable 
modifications. The datasets were filtered on 
posterior error probability to achieve 1% FDR 
on the protein level. Quantification was per- 
formed with the LFQ algorithm in MaxQuant 
using three replicate measurements per ex- 
periment. Results were compared against the 
CRAPome database (62) to eliminate common 
contaminants of Flag immunoprecipitations 
(data S3 and S4). The MS proteomics data have 
been deposited to the ProteomeXchange Con- 
sortium via the PRIDE (67) partner repository 
with the dataset identifier PXD029463. 


In vitro kinase assay 


To determine in vitro kinase activity of PIM1 
(PV3503, Thermo Fisher Scientific), PIM2 (PV3649, 
Thermo Fisher Scientific), PIM3 (A30516, Thermo 
Fisher Scientific), and Akti/PKB (P2999, Thermo 
Fisher Scientific) against GBP1 (ab114960, Abcam), 
the recombinant proteins were mixed with 
100 uM ATP (PV3227, Thermo Fisher Scientific) 
in kinase buffer (50 mM Tris-HCl pH 7.4, 10 mM 
MgCl.) and allowed to incubate for 30 min at 
30°C. Reactions were stopped by addition of 
Laemmli buffer containing 5% DTT and de- 
naturing at 95°C for 5 min before analysis by 
immunoblotting or silver stain. 


GTPase assay 


To determine GTPase activity, 2 uM GBPI, in vitro 
phosphorylated GBP1, or purified GBP1:14-3-30 
complex was incubated with 1 mM of GTP in 
reaction buffer (40 mM Tris pH 7.5, 80 mM 
NaCl, 8 mM MgCl, 1 mM EDTA) at room 
temperature. The concentration of free phos- 
phate product was subsequently determined 
using Malachite Green Phosphate Assay Kit 
(MAK307, Sigma) by measuring absorbance at 
620 nm and interpolating a phosphate stan- 
dard curve. 


Antibody binding assay 


Enzyme-linked immunosorbent assay (ELISA) 
plates were coated with 1 uM phosphorylated 
or unphosphorylated GBP1 peptides in 100 ul of 
coating buffer (50 mM carbonate/bicarbonate 
PH 9.6) at 4°C overnight. Plates were washed 
with TBS-T and blocked with 1% BSA in TBS-T 
for 1 hour at room temperature. Plates were 
washed again, and GBP1pS156 antibody was 
added in different dilutions in blocking buffer 
and left to bind at 4°C overnight. Plates were 
washed and probed with horseradish peroxi- 
dase (HRP)-conjugated anti-rabbit secondary 
antibody and developed with TMB solution 
(N301, Thermo Fisher Scientific) for 10 min at 
room temperature. The reaction was quenched 
by the addition of 2 N H,SO,, and absorption was 
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quantified on a plate reader (Fluostar Omega, 
BMG Labtech). 


ITC 


ITC was performed on a MicroCal VP-ITC. For 
each titration, 14-3-30 and GBP1 or previously 
in vitro phosphorylated GBP1 were dialyzed 
into reaction buffer (50 mM HEPES pH 7.4, 
150 mM KC)). Titrations were performed at 
25°C with serial injections of 7-ul aliquots of 
14-3-36 (125 or 250 uM) into a solution of GBP1 
or GBP1pS156 (12.5 or 25 uM, respectively) in 
the calorimeter cell (volume 1.3 ml). Data were 
analyzed with ORIGIN software (MicroCal). 


Fixed immunofluorescence microscopy 


For imaging, 0.25 x 10° cells were seeded on 
coverslips in 24-well plates. After differentiation, 
treatments, and infection, cells were washed 
three times with warm PBS before fixation and 
then fixed with 4% methanol-free formalde- 
hyde (28906, Thermo Fisher Scientific) for 15 min 
at room temperature. Fixed cells were washed 
with PBS and kept at 4°C overnight. Fixed spe- 
cimens were permeabilized with PermQuench 
buffer [0.2% (w/v) BSA and 0.02% (w/v) saponin 
in PBS] for 30 min at room temperature and then 
stained with primary antibodies for 1 hour at 
room temperature. After three washes with PBS, 
cells were incubated with secondary antibody 
and 1 ug/ml Hoechst 33342 (H3570, Invitrogen) 
diluted in PermQuench buffer for 1 hour at 
room temperature. Coverslips were extensively 
washed with PBS and once with ddH,O and 
mounted using 5 ul of Mowiol. Specimens were 
imaged on a Leica SP5-inverted confocal mi- 
croscope at 100x magnification. 


High-throughput imaging 

For infection analysis, 5 x 10* THP-1s were 
seeded per well of a black-wall, clear-bottom 
96-well imaging plate (Thermo Fisher Scien- 
tific), differentiated, treated, and infected as 
described above. Following fixation with 4% 
methanol-free formaldehyde, specimens were 
permeabilized and stained with 1 ug/ml Hoechst 
33342 and 2 ug/ml CellMask Deep Red plasma 
membrane stain (C10046, Invitrogen) at room 
temperature for 1 hour. After staining, the spe- 
cimens were extensively washed with PBS and 
kept in PBS for imaging. 

For recruitment analysis, the cells were pre- 
pared as described above but were seeded on 
black-wall, glass-bottom 96-well imaging plates 
CG 1.0 (130-098-264, MACS Miltenyi Biotec). 
After fixation, cells were permeabilized and stained 
with primary antibody diluted in PermQuench 
buffer for 1 hour at room temperature. After 
three washes with PBS, cells were incubated 
with the appropriated secondary antibody and 
1 ug/ml Hoechst 33342 diluted in PermQuench 
buffer for 1 hour at room temperature. Then, 
the specimens were extensively washed with 
PBS and kept in PBS for imaging. 
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Imaging for infection analysis used a Cell 
Insight CX7 High-Content Screening Platform 
(Thermo Fisher Scientific) or a Celldiscoverer 7 
(Zeiss) using 10x magnification. For recruitment 
analysis, plates were imaged on an Opera Phenix 
High-Content Screening System (PerkinElmer) 
or a Celldiscoverer 7 (Zeiss) using 20x/40x mag- 
nification. After image acquisition, the images 
were fed into the Host Response to Microbe 
Analysis (HRMAn) analysis pipeline (63, 64). 


Vacuole and parasite breakage assay 


For quantification of Tg vacuole integrity, cells 
seeded in black-wall 96-well imaging plates 
were infected and treated as described above. 
One hour before fixation, 1 g/ml HCS CellMask 
Deep Red (H32721, Invitrogen) was added to 
the culture medium to load the cytosol of host 
cells with this fluorescent dye. After fixation 
and staining with Hoechst 33342, plates were 
imaged at 20x magnification on a Celldiscoverer 
7 (Zeiss). Fluorescence of the dye within detected 
Tg vacuoles was then analyzed using HRMAn. 
For analysis of parasite disruption, cells express- 
ing GFP,, fragment were infected with type II 
(Pru) TgAHpt+GFP,,.-expressing parasites. After 
the infection time course, cells were fixed, stained 
with Hoechst and anti-Tg-SAGI, imaged, and 
analyzed for GFP fluorescence using HRMAn, 
as described previously (13). 


Fluorescence recovery after photobleaching 


Quantification of GBP1 recruitment dynamics 
was performed using fluorescence recovery after 
photobleaching (FRAP). For this, 0.1 x 10° THP-1 
cells were seeded per well of an eight-well ibidi 
u-slide (80826, ibidi) differentiated, treated, and 
infected with Tg as described above. At 2 hours 
after infection, the specimens were moved to a 
Leica SP5-inverted confocal microscope. The 
experiment was performed using the LAS soft- 
ware FRAP-wizard with the following parame- 
ters: 256 by 256 pixels, 1400-Hz line frequency 
scan speed with bidirectional scan, 2 Airy unit 
(AU) pinhole size, and bleaching laser power at 
100%. After selection of the bleach area on Tg 
vacuoles, the experiment was performed with 
the following time course: 10 frames at 120 ms 
prebleach, 1 frame at 120 ms bleach, 100 frames 
at 120 ms postbleach I, 10 frames at 1 s post- 
bleach II, and 10 frames at 5 s postbleach III. 
After acquisition, data was double normalized 
(65) to correct for acquisition bleaching. Fi- 
nally, using curve fit, the half-time of recovery 
was determined as a measurement for the mo- 
bility of the molecules. 


Imaging quantifications 


Most quantifications of images were performed 
using HRMAn (63, 64). HRMAn was used for 
quantification of Tg growth, Golgi fragmen- 
tation, recruitment of proteins to pathogen 
vacuoles, and measurement of cell shapes, 
sizes and fluorescence properties. The Pearson’s 
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correlation coefficient for colocalization anal- 
ysis was computed using Coloc2 FIJI plugin 
(66). The average distance of GBP1 aggregates to 
the nucleus was determined using AggreCount 
(67). Golgi localization of GBP1 was quantified 
by determining the mean fluorescence intensity 
(MFI) of GBP1 in the GM130* area and divided 
by the MFI of the respective cell cytosol to nor- 
malize for uneven expression. 


Recombinant protein expression and purification 


GBP] and 14-3-36 ORFs were amplified from 
pLenti-Tet-GBP7 and pLenti-Tet-14-3-30 and 
ligated into NdeI- and BamHI-digested pET15d 
vector, pretagged N-terminally with an MBP- 
8xHis-3C or 6xHis-3C sequence using Gibson 
assembly to obtain pET15d-MBP-8xHis-3C-GBPI1 
and pET15d-6xHis-3C-14-3-3o0, respectively. Plas- 
mids were transformed into BL21 EF. coli. Bacte- 
ria from a single colony were picked and grown 
in a 10 ml of LB plus ampicillin (EU0400-D, 
Euromedex) starter culture overnight. Starter 
cultures were used to inoculate 12 liters of LB 
plus ampicillin cultures (1:1000 dilution), which 
were grown to ODggo0 ~ 0.9. Protein expres- 
sion was induced by the addition of 1 uM 
B-p-1-thiogalactopyranoside (IPTG) (EU0008-B, 
Euromedex). GBP1 was expressed for 16 hours 
at 18°C and 14-3-36 at 30°C for 4 hours while 
shaking at 150 rpm. 

Bacteria were harvested by centrifugation, 
washed with PBS, resuspended in 100 ml of 
lysis buffer (GBP1: 50 mM HEPES pH 7.8, 
30 mM imidazole, 150 mM KCl; 14-3-30: 50 mM 
NaH,PO, pH 8.0, 10 mM imidazole; 300 mM 
NaCl), and lysed by sonication on ice (12 min, 
1-s pulses with 1-s interruptions). The lysate 
was cleared by high-speed centrifugation at 
20,000g for 1 hour at 4°C and cleared lysate 
loaded onto a 5-ml Ni-NTA column (ab270529, 
Abcam). The column was washed with 20 ml 
of lysis buffer, and bound proteins were eluted 
with a 40-ml gradient from 30 to 500 mM im- 
idazole in lysis buffer. Target protein-containing 
fractions were pooled and dialyzed overnight 
against size exclusion buffer [size exclusion 
chromatography (SEC) buffer: 50 mM HEPES 
PH 7.8, 150 mM KCl] while also digesting with 
His-3C protease (Protein Expression and Puri- 
fication Core Facility, EMBL Heidelberg). His-3C 
protease and cleaved tags were removed using 
reverse immobilized metal affinity chromatog- 
raphy Ni-IMAC purification (635660, Takara), 
and proteins were concentrated using 50-kDa 
cutoff spin column concentrators (Amicon Ultra). 

The full-length PIM1 ORF was amplified from 
pLenti-Tet-PZM7 and ligated into BamHI- and 
HindIII-digested pLIB vector modified with a 
N-terminal MBP-TEV-SBP tag using Gibson 
assembly to obtain pLIB-MBP-TEV-SBP-P/M1. 
The assembled vector was used for Baculovirus 
generation in SF21 cells (SF21: 11497013, Gibco; 
SF-900 II SFM: 11497013, Gibco) as described 
previously (68, 69). Five milliliters of Baculovirus 
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was used for transfection of 500 ml of Hid (T: nz 
High Five Cells: B85502, Invitrogen; ExpressFive 
SFM, 10486025, Gibco) culture and incubated 
at 27°C for 72 hours. Hi5 insect cells were har- 
vested by centrifugation, washed with PBS, 
resuspended in lysis buffer (60 mM HEPES 
PH 8.0, 300 mM NaCl), and lysed by sonication 
at 4°C (12 min, 1-s pulses with 1-s interruptions). 
The lysate was cleared by high-speed centrif- 
ugation at 20,000g for 1 hour at 4°C. Cleared 
lysate was loaded onto preequilibrated amy- 
lose resin (E8021S, New England Biolabs) and 
left to bind for 4 hours at 4°C. The resin was 
washed with 10 ml of lysis buffer three times, 
and bound PIM1 was eluted with 30 ml of lysis 
buffer containing 25 mM maltose. Obtained 
MBP-TEV-SBP-PIMI was dialyzed against 50 mM 
NaH,PO, pH 8.0, 300 mM NaCl, and 30 mM 
imidazole buffer overnight while also digest- 
ing with His-TEV protease (Protein Expression 
and Purification Core Facility, EMBL Heidelberg). 
His-TEV protease and cleaved tags were removed 
using serial reverse His- and amylose-affinity 
purification and PIM1 protein concentrated 
using 30-kDa cutoff spin column concentrators 
(Amicon Ultra). Bulk-purified GBP1, PIM1, and 
14-3-30 were further purified by SEC with a 
HiLoad 16/600 Superdex 200 column (Cytiva) 
in SEC buffer. 


GBP1:14-3-30 complex formation 
and purification 


For in vitro reconstitution of the GBP1:14-3-30 
complex, purified GBPI, 14-3-30, and PIM1 were 
mixed in a 2:4:1 ratio in complex formation 
buffer (Tris-HCl pH 7.4, 150 mM KCl, 10 mM 
MgCl,) with addition of 1 mM ATP and incu- 
bated at 30°C for 1 hour. Reaction samples 
were cleared by centrifugation at 20,000g for 
5 min at 4°C and ran over a Superdex 200 In- 
crease 10/300 GL column (Cytiva Lifesciences) 
in SEC buffer. To further polish for structure 
analysis, complex-containing fractions were 
pooled, concentrated with a 100-kDa cutoff spin 
column concentrator (Amicon Ultra), and re- 
purified by SEC. To test the effect of 14-3-30 
binding on GBP1 dimerization, samples were 
additionally incubated with 1 mM GTP or with 
1mM GDP plus 10 mM AIF; (449628, Merck). 


Single-particle cryo-EM and structure modeling 


The GBP1:14-3-36 complex was prepared as 
described above. Fractions corresponding to 
a heterotrimeric complex were vitrified in 
R1.2/R1.3 UltrAuFoil grids with a 300 mesh 
(Q350AR13A, Electron Microscopy Sciences) 
that were glow discharged for 20 s at 25 mA 
and 0.3 bar using a Pelco EasyGlow device. The 
sample concentration was adjusted to 0.25 mg/ml, 
and 2-ul drops in SEC buffer were applied to 
either side of the grid. Excess sample was re- 
moved by blotting using a Vitribot MARK IV 
at 4°C and 100% humidity for 2 s with blotting 
force 10 and subsequently plunge-frozen in 
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liquid ethane. The vitrified particles were imaged 
using a FEI Titan Krios cryo-transmission 
electron microscope (TEM) at 300 kV at EMBL 
Heidelberg equipped with a GIF Quantum K2 
direct electron detector and a GIF quantum 
energy filter (Gatan). In total, 14,974 movies 
were collected in counting mode with seven 
exposures per hole at a nominal magnification 
of 215,000x with a pixel size of 0.638 A”. Each 
movie was exposed for 8 s, resulting in a total 
dose of 65.5 e/A? over 40 frames. 

All data-processing steps were performed 
using cryoSPARC v3.2.0 unless stated other- 
wise (70). Imported movies were subjected to 
motion correction using the cryoSPARC multi- 
patch motion correction followed by multipatch 
contrast transfer function (CTF) estimation. 
Initially, particles were identified by using au- 
tomated Gaussian picking for elliptical par- 
ticles with 50 by 120 A. Picked particles were 
extracted, fourfold binned, and subjected to 
extensive two-dimensional (2D) classification, 
which identified 4129 particles corresponding 
to the 14-3-30/GBP1 complex. The particles were 
used to train Topaz (77) and repick on all mi- 
crographs, yielding 420,768 particles. 2D clas- 
sification identified 29,150 particles. Ab initio 
modeling with three models was used to gen- 
erate preliminary volumes. Reextraction fol- 
lowed by a heterogeneous refinement using the 
previously determined initial 3D volumes re- 
sulted in a similar particle distribution. Class 3 
was used to retrain Topaz for automated par- 
ticle picking. Identified particles were extracted 
with fourfold binning. Subsequent 2D classifi- 
cation identified 138,809 particles, which were 
used to generate five new initial models. The 
particles were reextracted with full pixel size 
and subjected to heterogeneous refinement 
with the previously determined 3D volumes. 
Class 3 yielded a model with 40,112 particles. 
The corresponding particles were further clas- 
sified into three ab initio models. A class with 
18,717 particles was identified and subjected to 
nonuniform refinement, yielding a final cryo-EM 
map at 51A. 

Densities were generated in CryoSPARC and 
loaded into ChimeraX (version 0.93) (72), and 
previously reported crystal structures [Protein 
Data Bank (PDB) IDs 1IF5N for GBP1 and TYWT 
for 14-3-30 dimer) were fitted using the pro- 
gram’s own density fitting tools. The final model 
was deposited in the Electron Microscopy Data 
Bank (EMDB) with accession number EMD-18149, 
and the rigid body docked model was deposited 
in the PDB with accession number 8Q4L. 


Evolutionary analysis of GBP genes 


Position-Specific Iterated-Basic Local Alignment 
Search Tool (PSI-BLAST) (73) searches using 
full-length GBP1 as query of the RefSeq Protein 
sequence database identified >3300 GBP1-like 
proteins from >500 species. Protein sequences 
were aligned with MAFFT (74), and incomplete 
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data were removed with MaxAlign v1.1 (75). The 
closest homolog of human GBP1 could be iden- 
tified based on sequence homology in 484 spe- 
cies, and the respective sequences were analyzed 
for the presence of the corresponding phospho- 
serine and PIM1 recognition motif. 


In silico analysis, data handling, and statistics 


Data were plotted using Prism 8.4.0 (GraphPad, 
Inc.) and presented as means of multiple exper- 
iments with error bars as standard deviation 
(SD), unless stated otherwise. The significance 
of results was determined by nonparametric 
one-way analysis of variance (ANOVA), unpaired 
Student's ¢ test, or two-way ANOVA as indicated 
in the figure legends. Benjamini, Krieger, and 
Yekutieli FDR (Q = 5%)-based correction for 
multiple comparisons as implemented in Prism 
was used when making more than three com- 
parisons. Structures of GBP1 were obtained 
from the PDB database under PDB IDs 1F5N 
and 1DG3 (76, 77), and images were rendered 
with MacPymol v.1.74, which was also used 
for B-factor analysis. Molecular graphics and 
analyses were performed with UCSF ChimeraX. 
GBP1 phosphorylation-site data were obtained 
from www.phosphosite.org (accessed 4 January 
2019). 14-3-3 binding site identification used 14-3- 
3-Pred (78). Kinase predictions were performed 
using NetPhos Server3.1 (79) and GPS5.0 (80), and 
recognition sequence phosphorylation-likelihood 
values were obtained from PhosphoNET: Kinexus 
(http://www.phosphonet.ca/). All open-source 
KNIME workflows used in this work can be found 
at https://github.com/HRMAn-Org/HRMAn and 
on the homepage https://www.hrman.org. 
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INTRODUCTION: Vaccination has greatly reduced 
the disease burden of SARS-CoV-2. However, 
since late 2020, variants have emerged that are 
able to escape immunity from vaccination and 
previous infections, including B.1.351 (Beta), P.1 
(Gamma), B.1.617.2 (Delta), and B.1.1.529 (Omi- 
cron and its descendants). In combination with 
vaccination, infections with different variants 
form the basis of current population immunity 
against SARS-CoV-2. 


RATIONALE: Understanding the antigenic rela- 
tionships among SARS-CoV-2 variants, the sub- 
stitutions that cause antigenic change, and how 
the immune response is shaped by previous in- 
fections, is crucial for understanding the evolu- 
tion of the virus, determining whether new 
variants avoid neutralization from antibodies 


induced by current vaccines, and evaluating the 
need for vaccine updates. 

To this end, we analyzed patterns of neutral- 
ization and cross-reactivity among a panel of 21 
SARS-CoV-2 variants and 15 groups of human 
sera obtained from individuals after primary 
infection with one of 10 different variants or after 
D614G or B.1.351 vaccination. First, we sought 
to understand patterns of cross-reactivity and 
response breadth in postvaccination responses 
4 weeks and >3 months after second or third vac- 
cine doses. Then, we used antigenic cartography 
to visualize antigenic relationships between 21 
SARS-CoV-2 variants and experimentally test point 
mutations to investigate the drivers of the antigenic 
changes observed in the antigenic map. Lastly, 
we investigated how serological reactivity post- 
infection relates to the primary-exposure variant. 


Serological responses following exposure 
to different SARS-CoV-2 variants. (Top) 
Antigenic map showing antigenic relatedness 
of SARS-CoV-2 variants. Variants are grouped 
by amino acid substitutions present at 
positions 417, 452, 484, and 501, with an 
additional grouping for the six Omicron 
variants. (Bottom) Average effect size of 
single substitutions at different RBD positions 
on sera from individuals with different 
primary exposures, colored by the amino 
acid present at the respective RBD position 
in the eliciting variant. 
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RESULTS: Quantifying changes in cross-react Shes 
and response breadth after vaccination, o 
sults show the largest increase between 4 weeks 
and >3 months after a second dose. In particular, 
we found that the main short-term effect of the 
third vaccination was to boost the magnitude of 
a response that had already become more cross- 
reactive rather than to generate significant addi- 
tional breadth of cross-reactivity. 

Using antigenic cartography, we inferred and 
subsequently experimentally tested our inference 
that antigenic differences among pre-Omicron 
variants are primarily caused by substitutions at 
spike-protein positions 417, 452, 484, and 501 
(see figure, top). The experimental effect of these 
substitutions was largely consistent with those 
inferred from the map. 

We also found that sensitivity to these sub- 
stitutions varied greatly between individuals 
infected with different variants. These differ- 
ences are consistent with substantial changes 
in immunodominance of different spike re- 
gions, depending on the variant an individual 
was first exposed to and the amino acid present 
at these positions in the eliciting variant (see 
figure, bottom). For example, whereas sera 
of individuals exposed to D614G, B.1.1.7, and 
B.1.617.2 variants are sensitive to changes at 
position 484, sera of individuals exposed to 
B.1.351 and P.1 are not. 


CONCLUSION: Our results provide a comprehen- 
sive analysis of the antigenic variation between 
SARS-CoV-2 variants and the development of 
the immune response after infection or vac- 
cination. The large antigenic effect of a small 
number of substitutions in the receptor-binding 
domain (RBD) of SARS-CoV-2 is similar to 
the pattern observed for seasonal influenza 
viruses, for which major antigenic changes are 
often associated with single or double substi- 
tutions. These substitutions in the SARS-CoV-2 
RBD not only allow the virus to escape from 
preexisting immunity but also influence the 
regions of the spike protein that the immune 
response targets. Depending on their infec- 
tion history, different individuals can thus be 
sensitive to substitutions in different regions 
of the spike protein. As individuals increas- 
ingly experience multiple infections, choosing 
vaccine immunogens on the basis of immuno- 
dominance considerations may be an impor- 
tant aspect in ensuring high vaccine efficacy 
across populations with different patterns of 
preexisting immunity. 
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During the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic, multiple 
variants escaping preexisting immunity emerged, causing reinfections of previously exposed individuals. 
Here, we used antigenic cartography to analyze patterns of cross-reactivity among 21 variants and 

15 groups of human sera obtained after primary infection with 10 different variants or after 
messenger RNA (mRNA)-1273 or mRNA-1273.351 vaccination. We found antigenic differences among 
pre-Omicron variants caused by substitutions at spike-protein positions 417, 452, 484, and 501. 
Quantifying changes in response breadth over time and with additional vaccine doses, our results 
show the largest increase between 4 weeks and >3 months after a second dose. We found changes 

in immunodominance of different spike regions, depending on the variant an individual was first 
exposed to, with implications for variant risk assessment and vaccine-strain selection. 


ince the beginning of the severe acute res- 

piratory syndrome coronavirus 2 (SARS- 

CoV-2) pandemic, the virus has caused 

more than 766 million cases and 6.9 mil- 

lion deaths (7). During the first year of 
the pandemic, circulation was dominated by 
the B.1 variant, characterized by the D614G 
substitution in the spike protein, which im- 
parted increased infectivity and transmissi- 
bility in vitro and in animal models (2, 3) but 
did not escape serum neutralization (4). Since 
then, multiple variants have circulated widely, 
with five of them—B.1.1.7 (Alpha), B.1.351 (Beta), 
P.1 (Gamma), B.1.617.2 (Delta), and B.1.1.529 (Om- 
icron and descendant sublineages)—categorized 
as variants of concern by the World Health Or- 
ganization (WHO) on the basis of evidence of 
higher transmissibility, increased virulence, 
and/or reduced effectiveness of vaccines, ther- 
apeutics, or diagnostics (5). Sublineages arising 
within the Omicron lineage show considerable 


antigenic diversity, often being as antigenically 
different from the early Omicron variant (Om- 
icron BA) as BA.1 is from the B.1-like viruses (6). 

Before the emergence of the Omicron var- 
iants, several variants circulating widely in 2021 
were antigenically distinct from the B.1-like 
2020 viruses, with B.1.351 escaping neutraliza- 
tion by convalescent and postvaccination sera 
the most (7, 8). Moreover, sera from individ- 
uals first infected with B.1.351 and P.1 failed 
to readily neutralize B.1.617.2, and vice versa, 
although B.1.617.2 did not show strong escape 
from convalescent sera after infection with 
prototype-like variants (9, 10). All variants in 
the Omicron lineage show substantial escape 
from postvaccination and convalescent sera 
after infection with the pre-Omicron variants 
(11, 12). 

Understanding the antigenic relationships 
among SARS-CoV-2 variants and the substitu- 
tions that cause antigenic change is required 


for evaluating the need for vaccine updates 
and predicting whether new variants avoid 
neutralization from antibodies induced by 
current vaccines. However, with an increasing 
number of variants, understanding antigenic 
relationships through neutralization titer data 
becomes more complex. Antigenic cartogra- 
phy (73) is a tool originally developed for the 
analysis of human seasonal influenza virus 
antigenic data and has since been used in the 
analysis of antigenic variation in other patho- 
gens, including avian, equine, and swine influ- 
enza viruses (74-18); flaviviruses (19), including 
dengue viruses (20, 27); lyssaviruses (22); and 
foot-and-mouth disease viruses (23). It provides 
a quantitative and visual summary of antigenic 
differences among large numbers of variants 
and is a core component of the biannual in- 
fluenza virus vaccine-strain selection process 
convened by the WHO. In this study, we used 
antigenic cartography to analyze patterns of 
cross-reactivity among a panel of 21 SARS- 
CoV-2 variants and 15 groups of human sera 
obtained from individuals after primary infec- 
tion with one of 10 different variants or after 
prototype or B.1.351 primary-series vaccination. 
We followed this with experimental testing of 
point mutations to investigate the drivers of the 
antigenic changes observed and how the effects 
of subsequent changes on serological reactivity 
relate to the primary-exposure variant. 


Results 
Variant reactivity by serum group 


We used 207 serum specimens collected from 
vaccinated or infected individuals (table S1) 
and titrated in a Food and Drug Administra- 
tion (FDA)-approved (FDA/CBER Master File 
026862) neutralization assay against lentiviral 
pseudotypes encoding the spike protein of 21 
SARS-CoV-2 variants (fig. $1). Titrations in- 
cluded a panel of 15 pre-Omicron variants and 
6 Omicron variants. Before the collection of 
each serum sample, individuals had reported 
no known previous SARS-CoV-2 infections or 
vaccination (table S1). The infecting variant 
was determined by using whole-genome se- 
quencing. The serum specimens are from in- 
dividuals with infections with D614G (n = 15), 
B.1.1.7 (Alpha, 2 = 14), B.1.351 (Beta, n = 19), 


'Center for Pathogen Evolution, Department of Zoology, University of Cambridge, Cambridge CB2 3EJ, UK. “Institute of Virology, C| 
Universitat Berlin, Humboldt-Universitat zu Berlin, and Berlin Institute of Health, 10117 Berlin, Germany. “German Centre for Infecti 
Germany. “Department of Surgery, Duke University School of Medicine, Durham, NC, USA. "Duke Human Vaccine Institute, Duke U 
Global Pediatric Medicine, Department of Infectious Diseases, St. Jude Children’s Research Hospital, Memphis, TN, USA. “Hospital 
Department of Pathology, Duke University School of Medicine, Durham, NC, USA. °Centers for Disease Control and Prevention, At 


University of Wisconsin-Madison, Madison, WI, USA. “BC Centre for Disease Control, Vancouver, British Columbia, Canada. “Divisi 
Tokyo, Japan. “The Research Center for Global Viral Diseases, National Center for Global Health and Medicine Research Institute, 


Cellular and Molecular Medicine, Icahn School of Medicine at Mount Sinai, New York, 


Public Health, New York, NY, USA. *Department of Genetics and Genomic Sciences, Icahn School of Medicine a 
Children’s Research Hospital, Memphis, TN, USA. 
*Corresponding author. Email: sw463@cam.ac.uk (S.H.W.); david.montefiori@duke.edu (D.C.M.); dsmith@zoo.cam.ac.uk (D.J.S.) 


These authors contributed equally to this work. 


Wilks et al., Science 382, eadj0070 (2023) 6 October 2023 


harité — Universitatsmedizin Berlin, corporate member of Freie 
ion Research (DZIF), partner site Charité, 10117 Berlin, 

niversity School of Medicine, Durham, NC, USA. °Department of 
Nacional Daniel A. Carridn, Callao, Bellavista, Peru. 

lanta, GA, USA. ‘Erasmus Medical Center, Rotterdam, 


Netherlands. “School of Public Health, Universidad Peruana Cayetano Heredia, Lima, Peru. “Influenza Research Institute, Department of Pathobiological Science, School of Veterinary Medicine, 


on of Virology, Institute of Medical Science, University of Tokyo, 
Tokyo, Japan. Pandemic Preparedness, Infection and 


Advanced Research Center (UTOPIA), University of Tokyo, Tokyo, Japan. ‘Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, NY, USA. ‘®Department of Pathology, 
Y, USA. *°Moderna, Inc., Cambridge, MA, USA. “°Division of Infectious Diseases, Department of Medicine, 
cahn School of Medicine at Mount Sinai, New York, NY, USA. “The Global Health and Emerging Pathogen Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA. “Division of 
Child and Adolescent Health, Department of Pediatrics, Columbia University Vagelos College of Physicians and Surgeons, and Department of Population and Family Health, Mailman School of 
Mount Sinai, New York, NY, USA. “Departmen 


of Infectious Diseases, St. Jude 


1 of 15 


RESEARCH | RESEARCH ARTICLE 


P.1 (Gamma, 7 = 17), B.1.617.2 (Delta, 7 = 28), 
B.1.526+E484K (Iota, m = 6), B.1.637 (m = 3), 
C.37 (Lambda, 7 = 4), BA.1 (Omicron, n = 7), and 
BA.2 (Omicron, 7 = 1). We also included groups 
of sera collected from individuals twice vacci- 
nated with a vaccine comprising prototypic 
SARS-CoV-2 spike (Moderna mRNA-1273) at 
4 weeks (n = 32) or >3 months (n = 16) after a 


second dose, and sera after a third dose of the 
same vaccine at 4 weeks (n = 26) or >3 months 
(n = 8). Lastly, we included samples collected 
from individuals 4 weeks after a second dose 
of a B.1.351 spike vaccine (Moderna mRNA- 
1273.351) (n = 11). 

The geometric mean titer (GMT) and indi- 
vidual reactivity profiles for 183 sera after the 


exclusion of 24 outliers with titers indicative of 
an unreported previous infection are shown 
in Fig. 1 (see “Excluding outlier sera” section 
in Materials and methods, and fig. $2). Each 
serum group exhibited a distinct profile of neu- 
tralization against the tested variants and, as 
expected, homologous serum and virus pairs 
were among the most potent in each group 
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Fig. 1. Neutralization of lentivirus pseudotypes encoding different SARS-CoV-2 
spike proteins against different groups of human sera collected after 
vaccination or primary infection with different variants. Serum groups are 
split into (A) sera elicited by infection with different variants and (B) sera elicited by 
vaccination. Variants are ordered according to geometric mean titer (GMT) in D614G 
sera [(A), top left], and additional Omicron variants are ordered chronologically. 
Bold lines with empty circles show the GMTs calculated after estimated differences 
in individual response size were removed to mitigate biases in which not all sera 
from a group were titrated against a particular variant (as described in Materials 
and methods “Titer analyses” section). Fainter individual lines and solid points 
show individual serum titers. Points in the gray region at the bottom of the plots 
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show titers and GMTs that fell below the detection threshold of 20. Each panel is 
labeled according to the respective serum group and color coded as indicated on 
the x axis. Figure S3 shows titers split by sample source for the five serum groups 
in which samples came from a mixture of cohorts or agencies. Titer box plots, line 
plots showing the individual serum titers after accounting for individual effects, 
and titer fold differences relative to the homologous titer are shown in figs. S4, S5, 
and S6, respectively. Single-letter abbreviations for the amino acid residues are 

as follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; I, lle; K, Lys; L, Leu; 
M, Met; N, Asn; P, Pro; Q, Gln; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. In the 
mutants, other amino acids were substituted at certain locations; for example, 
D614G indicates that aspartic acid at position 614 was replaced with glycine. 
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(although some serum groups included only 
a small number of sera) (Fig. 1A). The B.1.351 
and P.1 serum groups both exhibited similar 
cross-neutralization of the B.1.351 and P.1 var- 
iants, suggesting a shared phenotype consistent 
with the closely matched receptor-binding do- 
main (RBD) substitutions in these two var- 
iants (fig. S1). The Omicron variants showed 
the greatest escape from sera after second and 
third mRNA-1273 vaccination. Notably, BA.4/ 
BA.5 titers were less than the other Omicron 
variants tested in the 4-weeks post-third mRNA- 
1273 vaccination sera but not in the samples 
taken directly before a third dose (>3 months 
after second mRNA-1273) or >3 months after 
a third dose. Comparable escape was seen for 
the Omicron and B.1.617.2 variants against sera 
obtained after two doses of mRNA-1273.351. 
Most BA.1 sera, and the BA.2 serum sample, 
showed no detectable reactivity to any pre- 
Omicron variants. However, for one BA.1 serum 
sample, in which titers were very high, some 
low levels of measurable reactivity were mea- 
surable for the B.1.351 and D614G variants 
(Fig. 1A). Overall, our findings relating to rela- 
tive antigenic escape of the different Omicron 
variants against different serum groups were 
consistent with findings in other studies that 
used both lentivirus pseudotype and live-virus 
neutralization assays, in which overlap was 
present (12) (fig. $7). 

Comparing postvaccination and _ postinfec- 
tion sera, less variation among individuals was 
seen in the mRNA-1273 and mRNA-1273.351 
vaccine sera than in the corresponding D614G 
and B.1.351 convalescent sera, both in terms of 
response magnitude and the pattern of reac- 
tivity seen against the variants (Fig. 1 and fig. 
85). This is likely related to the more stand- 
ardized dose (24) and nature of vaccination 
compared with infection (25-27) but might 
also be due to varying time intervals between 
infection and specimen collection in the con- 
valescent specimens compared with those in 
postvaccination specimens (28). 


Comparing vaccine response breadth 


We calculated the breadth of postinfection 
and postvaccination responses, controlling for 
differences in titer magnitude by focusing on 
changes in the pattern of fold drops for each 
variant relative to the homologous variant (see 
“Calculating fold-drop differences in vaccine 
sera” in Materials and methods). We found 
that titer fold drops relative to D614G in each 
of the mRNA-1273 vaccination serum groups 
had a pattern similar to those of D614G con- 
valescent sera, but the size of the fold drops 
was decreased, corresponding to an increased 
response breadth (Fig. 2A). Moreover, we found 
a temporal pattern of increasing response 
breadth (Fig. 2B), with the largest increase be- 
tween 4 weeks and >3 months after a second 
dose and, to a lesser extent, between 4 weeks 
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and >3 months after a third dose. In samples 
taken 4 weeks after a third vaccine dose, al- 
though titers were strongly boosted, breadth 
remained very similar to that measured in 
>3 months after two-dose mRNA-1273 sam- 
ples taken directly before the third dose. For the 
B.1.351 postinfection and postvaccination groups, 
although titers were higher in the mRNA-1273.351 
vaccination group compared with those of B.1.351 
convalescent sera, we did not find evidence for 
a significant difference in the breadth of cross- 
reactivity (fig. S8). 


Antigenic cartography 


To visualize and quantify how the different var- 
iants relate to each other antigenically, we used 
the titrations shown in Fig. 1 to construct an 
antigenic map, in which antigens and sera 
are positioned relative to each other such 
that the distance between them corresponds 
to the fold drop compared with the maximum 
serum titer (see “Antigenic cartography” in 
Materials and methods). To incorporate the 
information from the different post-mRNA- 
1273 vaccine serum groups but also account 
for how their increased cross-reactivity would 
otherwise underestimate the relative anti- 
genic differences between variants, we scaled 
distance estimates from these serum groups 
according to the fold-change difference esti- 
mates shown in Fig. 2B. Cross-validation re- 
sults indicated that the neutralization data 
could generally be well represented in two 
dimensions (fig. S9), as shown in Fig. 3A. 
Overall, the antigenic relationships depicted 
in this map were robust to assay noise and the 
exclusion of serum groups and variants (Ma- 
terials and methods, and figs. S10 to S21). The 
antigenic distinction between the B.1.617.2 
variant and the three B.1.617.2 variants with 
K417N was, however, found to be predomi- 
nantly driven by patterns of reactivity in the 
B.1.617.2 sera (figs. S16 and S18). 

The clearest deviation from a two-dimensional 
(2D) representation of the antigenic relation- 
ships was due to the BA.4/BA.5 variant titers, 
for which antigenic differences fit better in 
three dimensions (Fig. 3B). Compared with its 
position in 2D space, the variant occupies a po- 
sition that is more antigenically distinct from 
the other Omicron variants and closer to the 
B.1.617.2 sera. This positioning of BA.4/BA.5 
reflects the fold change between BA.1 and 
BA.4/BA.5 in the B.1.617.2 and BA.1 serum 
groups (fig. $25). BA.4/BA.5 showed a drop 
in titers compared with BA.1 in the BA.1 con- 
valescent sera but showed increased titers 
compared with BA.1 in the B.1.617.2 conva- 
lescent sera, possibly because BA.4/BA.5 and 
B.1.617.2 share the substitutions L452R and 
T478K. The relative antigenic distances esti- 
mated for BA.4/BA.5 were quite robust in re- 
lation to pre-Omicron serum groups, but the 
lower number of Omicron sera did lead to 
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some additional uncertainty in the antigenic 
distance relative to the BA.1 and BA.2 sera. 


Serological reactivity shown by antibody landscapes 


Antigenic relationships depicted in the anti- 
genic map provide a summary of how the reac- 
tivity of the different serum groups distributes 
among the variants. Figure 4 extends this 
visualization to antibody landscapes, where 
a surface in a third dimension represents an 
estimate of how the reactivity of each serum 
group and individual serum varies across an- 
tigenic space (see “Construction of the anti- 
body landscapes” in Materials and methods). 
The v-y plane is depicted by the antigenic map 
in Fig. 3A, and the height of the landscape over 
a particular point or variant represents the es- 
timated magnitude of serum reactivity in that 
antigenic region. Antibody landscapes there- 
fore give an indication of how serum reactivity 
distributes after exposure to different variants, 
show how the magnitudes of the responses 
compare, and predict expected levels of reac- 
tivity to variants that have not been titrated 
(fig. S26). 

Consistent with the titers in Fig. 1, the shape 
of the antibody landscapes is similar for the 
D614G and B.1.1.7 serum groups, with the high- 
est reactivity centered on the D614G and B.1.1.7 
variants. Similarly, the shape of the landscapes 
generated by the mRNA-1273.351, B.1.351, and 
P.1 sera is comparable, with highest reactivity 
centered on the B.1.351 and P.1 variants. The 
landscape of the B.1.617.2 sera shows a con- 
trasting topology, with the highest reactivity 
indicated against the B.1.617.2 variant and 
falling off toward other areas of the map. On 
average, the B.1.617.2 sera titers against non- 
B.1.617.2 variants were often lower than pre- 
dicted from the landscape. This suggests either 
that the B.1.617.2 sera could discriminate be- 
tween B.1.617.2 and other variants more than 
was seen in reverse in the non-B.1.617.2 serum 
groups or that our measurements of B.1.617.2 
serum antibodies against B.1.617.2 were biased 
toward higher values (fig. S26). Because our 
data showed a larger difference in reactivity 
between the B.1.617.2 and D614G variants in 
B.1.617.2 sera when compared with other sources 
(fig. S7), we speculate that the latter possibility 
may be the case. Separately, and in agreement 
with the titers, the landscapes show that the 
pre-Omicron sera investigated here would be 
expected to have markedly reduced reactivity 
against variants in the Omicron lineage (Fig. 4 
and fig. S26). However, we note again that titers 
against the BA.4/BA.5 variant are less reliably 
predicted when assuming antigenic relation- 
ships in only two dimensions. Landscapes of 
BA.1 first-infection sera show that the drop-off 
of titers to largely nondetectable levels against 
pre-Omicron variants is steeper than that seen 
in reverse for the pre-Omicron sera landscapes. 
This could be related to the small number of 
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Fig. 2. Comparison of fold drops with different variants in post-D614G 
infection and post-mRNA-1273 vaccination sera. (A) Comparison of different 
estimates of titer fold-drop responses against different variants. Solid points 
show the estimate for the mean fold drop compared with the titer for D614G, and 
solid lines represent the 95% HDI for this estimate. The points for D614G to the 
left of the plot represent the homologous virus against which fold change for 
other strains was compared and are therefore fixed at 1. Dotted lines and 
outline circles show estimates based on a model that assumes a shared overall 
pattern of fold drops but estimates “slope” differences in the rate of reactivity 
drop-off seen in the four serum groups (as described in Materials and 
methods, “Calculating fold-drop differences in vaccine sera”). To aid comparison, 
points and lines for each of the serum groups are slightly offset from each 
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other in the x axis to ensure that points do not directly overlap each other. 

(B) Estimates of fold-drop magnitudes for each mRNA-1273 serum group, relative 
to the fold-drops seen in the D614G convalescent serum group. Lines show 

the 95% HDI for each of the estimates, and the position on the x axis is proportional 
to the number of months since a second vaccine dose, assuming an average of 

6 months for sera in the >3-months post-2x mRNA-1273 and >3-months post-3x 
mRNA-1273 groups. (C) Antibody landscapes showing how estimates of the mean 
titer for each of the serum groups in (A) vary across antigenic space. (D) Antibody 
landscapes, as shown in (C), but with a fixed peak titer of 2560 against the 
D614G variant to highlight differences in titer drop-off slopes. Interactive versions 
of the landscapes shown in (C) and (D) are available at https://acorg.github.io/ 
mapping_SARS-CoV-2_antigenic_relationships_and_serological_responses. 
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Fig. 3. Antigenic map of SARS-CoV-2 variants and selected substitutions. 
(A) Antigenic map with variant names. (B) Antigenic map with variant positions 
in 3D representation and lines connecting to their respective positions in the 
2D map. (C) Antigenic map with variant names and substitutions annotated 
and grouped by amino acid present at spike positions 417, 452, 484, and 

501, with an additional grouping for the six Omicron variants. Variants are shown 
as circles, sera as squares. Variants with additional substitutions from a root 
variant are denoted by smaller circles in the color of their root variant. The x 
and y axes both represent antigenic distance, with one grid square corresponding 


to a twofold serum dilution in the neutralization assay. Therefore, two grid 
squares correspond to a fourfold dilution, three to an eightfold dilution, and so 
on. The x-y orientation of the map is free because only the relative distances 
between variants and sera are relevant. Triangular arrowheads at the edge 

of the bounding box point in the direction of the sera that would be shown 
outside of the plot limits. A nonzoomed version of this map is shown in 

fig. S24. Interactive versions of the maps shown in (A) and (B) are available 
at https://acorg.github.io/mapping_SARS-CoV-2_antigenic_relationships_ 
and_serological_responses. 


slopes of titer reduction across antigenic space, 


BA.1 sera or to inherent asymmetries in BA.1 
and pre-Omicron sera cross-reactivity. In gen- 
eral, the result is consistent with other results 
showing that BA.1 infections generate low titers 
to pre-Omicron variants (29, 30). 
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The landscapes for different mRNA-1273 
postvaccination sera confirm expectations for 
how cross-reactivity differs depending on the 
number of, and time since, vaccinations. These 
differences can largely be modeled by different 
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with reactivity that peaks in the same anti- 
genic region but decreases at differing rates 
(Fig. 4, K to M, and Fig. 2, B and C). The breadth 
of increase is clear at >3 months after a second 
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Fig. 4. Antibody landscapes for each 
serum group. Colored surfaces show 
the GMT antibody landscapes for 

the different serum groups; light-gray 
surfaces show the landscapes for 
individual sera. Gray impulses (vertical 
lines) show the height of the GMT for a 
specific variant, after accounting for 
individual effects as described in 
Materials and methods (which would 
otherwise bias the GMT for variants not 
titrated against all sera). The base x-y 
plane and plane colors correspond to 
the antigenic map shown in Fig. 3 

and reproduced in (P). The vertical 

z axis in each plot corresponds to the 
titer on the logs scale; each twofold 
increment is marked, starting from a 
titer of 20, one unit above the map E 
surface. The gray horizontal plane 
indicates the height of a titer of 50 as a 
reference for judging the landscapes 
against various estimates of neutralizing 
antibody correlates of protection. 
Additional visualizations of predicted 
versus fitted titers are shown in fig. S26. 
The number of sera included for the 
calculation of the landscapes are G 
(A) D614G sera (n = 13), (B) B.1.1.7 sera 
(n = 18), (€) B.1.351 sera (n = 15), 

(D) P.1 sera (n = 13), (E) B.1.617.2 sera 
(n = 21), (F) B.1.526+E484K sera (n = 4), 
(G) B.1.637 sera (n = 2), (H) C.37 sera 
(n = 2), (I) BA.1 sera (n = 4), (J) BA.2 sera 
(n = 1), (K) 4-weeks post-2x mRNA-1273 
sera (n = 30), (L) >3-months post-2x 
mRNA-1273 sera (n = 13), (M) 4-weeks 
post-3x mRNA-1273 sera (n = 26), 

(N) >3-months post-3x mRNA-1273 sera 
(n = 8), and (O) 4-weeks post-2x 
mRNA-1273.351 sera (n = 9). Interactive 
versions of the landscapes in each of 
the panels are available at https://acorg. 
github.io/mapping_SARS-CoV-2_ 
antigenic_relationships_and_serological_ 
responses. 
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mRNA-1273 vaccine dose and changes magni- 
tude but maintains similar breadth at 4 weeks 
and >3 months after a third mRNA-1273 vac- 
cine dose. 


Molecular basis of the antigenic map topology 


As shown in Fig. 3C, the locations of variants 
in the antigenic map point to amino acid sub- 
stitutions that are shared between pre-Omicron 
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variants with similar antigenic characteris- 
tics. For example, variants with substitutions 
at position 484 (E to K or Q) are positioned 
on the right of the map because of poorer neu- 
tralization by D614G and vaccine sera. Variants 
toward the top of the map [B.1.1.7, B.1.351, P.1, 
B.1.621 (Mu), and B.1.1.7+E484K] all have a 
substitution at position 501 (N to Y). Variants 
B.1.351 and P.1 additionally have substitutions 
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4 weeks post 2x mRNA-1273 
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4 weeks post 3x mRNA-1273 


>3 months post 3x mRNA-1273 


ie) 


4 weeks post 2x mRNA-1273.351 


at position 417, indicating that these changes 
are associated with increased reactivity to B.1.351 
and P.1 sera. Variants in the lower half of the 
map [B.1.617.2, B.1.429 (Epsilon), C.37, and 
B.1.617.1 (Kappa)] all have substitutions at po- 
sition 452 (L to R or Q). The Omicron variants, 
carrying at least 15 additional substitutions in 
the RBD, form a separate cluster in the lower 
right of the antigenic map. 
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To further investigate the molecular basis 
of these antigenic differences, we generated 
10 lentivirus pseudotypes with single substi- 
tutions at positions 417, 452, 484, and 501 in 
different RBD contexts and measured the ef- 
fect on reactivity to different serum groups 
and the subsequent positioning of variants in 
the antigenic map. As shown in Fig. 5, the 
antigenic effect of the different substitutions 
when introduced in isolation was generally 
consistent with that inferred from the antige- 
nic map of wild-type variants. Large antigenic 
effects were seen for substitutions at position 
484, which were associated with the right-left 
antigenic variation seen in the map, such that 
D614G+E484K showed greater escape than 
D614G+E484Q in D614G sera (Fig. 5, A and 
B, and fig. S27, A and B). Smaller but signif- 
icant effects were seen for substitutions at po- 
sition 417, which was shown to mediate some 
of the top-bottom map variation (Fig. 5B and 
fig. S28). Introduction of the N5O1Y substitu- 
tion alone into D614G did not mediate large 
antigenic changes in the map but did cause 
significantly increased reactivity to B.1.1.7, P.1, 
and B.1.351 sera (fig. S29) and generated a virus 
that was antigenically similar to B.1.1.7, which 
also contains only the substitution N5OTY in the 
RBD (Fig. 5A and fig. S1). 

Despite overall correspondence with map- 
based predictions, some results were unex- 
pected. In particular, although D614G+L452R 
showed decreased reactivity to D614G sera 
(fig. S27C), the D614G+L452R+E484Q mutant 
did not show any significant difference in reac- 
tivity compared with that of the D614G+E484Q 
mutant (fig. $27, B and D). Moreover, the B.1429+ 
K417N mutant showed increased reactivity to 
the D614G sera (fig. S28D), even though this 
represents a change away from sequence iden- 
tity with D614G. Because it has been shown 


that the presence of the K417N substitution 
alone in the absence of 501Y greatly reduces 
angiotensin-converting enzyme 2 (ACE2) affi- 
nity in some contexts (37), we speculate that the 
effect seen in B.1.429+K417N may be influenced 
by an artificial inflation of titers generally. 
Furthermore, the introduction of the A484K 
substitution into the BA.1 context allowed us 
to compare the effect of the alanine (A) sub- 
stitution present in the Omicron variants in 
contrast to the lysine (K) substitution seen in 
pre-Omicron antigenic escape variants, such 
as B.1.351. We found that the BA.1+A484K mu- 
tant had lower titers compared with the BA.1 
variant for sera taken 4 weeks after a second 
mRNA-1273 vaccine dose and also for B.1.617.2 
sera (fig. S30), raising the question as to why 
this alternative substitution has not been seen 
more frequently in Omicron variants in na- 
ture. In terms of movement in the antigenic 
map, in two dimensions, the BA.1+A484K mu- 
tant moves to a location to the top right of the 
BA.1 variant, bringing it closer to the B.1.351 sera 
but also consequently close to BA.4/BA.5 (fig. 
S31). However, antigenic relations with BA.4/ 
BA.5 are better described in 3D representation 
(Fig. 3B), in which the BA.4/BA.5 variant uses 
the third dimension and the BA.1+A484K mu- 
tant occupies a novel area of antigenic space dis- 
tinct from the other Omicron variants (Fig. 5C). 


Variation in immunodominance of different RBD 
sites between serum groups 


Overall, among the substitutions tested, not all 
serum groups were equally sensitive to changes 
at a given position, with some substitutions hav- 
ing a large effect on reactivity to certain serum 
groups but little to no effect in others. For exam- 
ple, although the D614G+E484K and D614G+ 
E484Q substitutions had large effects on re- 
activity to D614G sera, no significant effect on 
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Fig. 5. Antigenic maps including laboratory-made mutants with substitu- 
tions at positions 417, 452, 484, and 501. (A) Variants with substitutions 
N501Y, E484K, E484Q, and L452R+E484K in the background of D614G; D614G+ 
L452R is not shown because it was titrated against only D614G sera, so its 
position could not be determined. (B) Variants with the T/N417K substitution in 
the background of P.1 and B.1.351, respectively, and K417N in the background of 
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B.1.351 serum reactivity was found (fig. $27, A 
and B). Such findings are consistent with var- 
jation in immunodominance patterns and in 
the extent to which antibodies in different sera 
target different structural regions in the RBD. 

We sought additional evidence for immuno- 
dominance switches by analyzing results from 
the mutants alongside differences in serum 
reactivity between other pairs of variants that 
differed by single-amino acid substitutions in 
the RBD (Fig. 6A and figs. S32 and S33). 

For all pairs of variants with differences in 
the RBD only at position 484 from E to K or Q 
(Fig. 6A, rows 1 and 2), D614G sera consistently 
showed significantly reduced reactivity [by 
using the 95% highest-density interval (HDI) 
estimate of the pairwise difference in titers as 
a criterion throughout], whereas the B.1.351 
sera showed little to no significant change in 
reactivity. Across the other serum groups, sera 
from individuals infected with variants with 
the ancestral 484E (2x mRNA-1273, D614G, 
B.1.1.7, and B.1.617.2) were generally sensitive 
to differences at position 484, whereas sera 
from infections with variants with 484K (B.1.351, 
P.1, and B.1.526+E484K) were more resistant 
to these substitutions, although B.1.351 did 
show evidence for some smaller increases in 
titers in some cases. The BA.1+A484K mutant 
also reflected this pattern of sensitivities in the 
serum groups against which it was titrated, 
with significant titer decreases in mRNA-1273 
and B.1.617.2 sera (484E) and a smaller increase 
in reactivity to MRNA-1273.351 sera (484K) (fig. 
$30, S33). Of the four BA.1 sera (484A), the mean 
fold decrease of 2.1 against the BA.1+A484K mu- 
tant versus the BA.1 variant indicated that these 
sera are also somewhat sensitive to changes 
at the 484 position. However, this effect was 
far less than that seen for the D614G+E484K 
mutant on reactivity to post-D614G infection 


BA.1+A484K 


B.1.429 and B.1.617.1. (C) BA.1 with the substitution A484K. The map in (C) is 3D 
to highlight the antigenic differences between BA.1, BA.1+A484K, and BA.4/BA.5. 
The 2D version of (C) is shown in fig. S31. Arrows point from the antigenic 
position of the root virus to that of the laboratory-generated variant. Interactive 
versions of the maps shown in each panel are available at https://acorg.github. 
io/mapping_SARS-CoV-2_antigenic_relationships_and_serological_responses. 
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Fig. 6. Effect of pairwise amino acid differences on reactivity to different 
serum groups. This plot compares the average fold difference in titer between 
(A) different pairs of variants that differ by only a single amino acid difference 
in the RBD or (B) that do not differ by any amino acids in the RBD but 


variants compared 
additional amino di 


with additional ami 


differ. Filled circles indicate where pairs of variants had no 
fferences in the NTD region, often because one was generated 
as an artificial mutant. By contrast, open circles indicate pairs of variants 

no acid differences in the NTD region, in addition to the RBD 
amino acid difference listed. The estimate for B.1.617.2 sera fold differences 


differ in the NTD. Comparisons are grouped by serum group (panel columns) 
and corresponding RBD difference (panel rows). In each panel, the circle 
represents the estimate for the average fold difference in titer between variant A 
and variant B, as named on the left-hand side of the plot, and the lines extend 
to indicate 95% HDI for this estimate. The black dashed line marks a fold 
difference in titer of 1 (no difference), and the colored dashed line indicates the 
average fold difference between all pairs of variants with that substitution in the 
RBD. Points and lines are colored according to the amino acid in the variant 
homologous to that serum group, at the position in the RBD where the pair of 


between the B.1.617.2 and B.1.617.2 (AY.3)+E484Q variants [(A), third column, 
second row], which falls outside the plot, is -59.4 (95% HDI -117.1, -30.7). 
Details of how fold-difference estimates and HDls were calculated are 
described in Materials and methods. Figures S30 and S31 show the same 
results against an expanded set of pairwise amino acid differences. Interactive 
scatterplots comparing titers against each pair of variants for each serum 
group are available at https://acorg.github.io/mapping_SARS-CoV-2_antigenic_ 
relationships_and_serological_responses. 


and vaccination sera, suggesting that like the 
B.1.351 and P.1 serum responses, the BA.1 re- 
sponse also appeared to be directed away from 
the 484 position. 

Where variants differed only by N50TY (Fig. 6A, 
row 3), we also found that sensitivity was linked 
to the amino acid at position 501 in the infecting 
variant. The two serum groups resulting from 
infections with B.1.351 and B.1.1.7 (both with 
5O1Y) were sensitive to differences at position 
501, whereas the serum groups postprototype 
vaccination and post-D614G, B.1.526+E484K, 
and B.1.617.2 infection (variants with ancestral 
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501N) were typically not sensitive to the N501Y 
substitution. 

For K417N comparisons (Fig. 6A, row 4), there 
was evidence for equal or decreased titers in 
serum groups with exposure to variants with 
the ancestral K at position 417 (mRNA-1273, 
D614G, B.1.617.2, and B.1.1.7 serum groups) and 
a corresponding overall increase in titers to the 
serum group after infection with B.1.351 (417N). 
P.1 sera (417T) showed increased reactivity to 
the K417N substitution. The B.1.351 sera (417N) 
also showed increased reactivity associated 
with the K417T substitution, likely reflecting a 
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structural homology of the changes caused by 
K417T and K417N. 

In addition, where variants allowed for a 
comparison of the effect of the substitution 
1452R (Fig. 6A, row 5), we found evidence for 
most of the serum groups distinguishing be- 
tween variants that differed by this substitu- 
tion, with decreased (mRNA-1273, D614G, B.1.1.7, 
and P.1 sera) or increased (B.1.617.2 sera) titers, 
corresponding to the amino acid present at 
position 452 in the eliciting variant. One ex- 
ception was the B.1.351 sera, in which, unlike 
the other serum groups with the ancestral 452L, 
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the L452R substitution did not produce an over- 
all decrease in titers. 


Impact of changes in the NTD 


We also investigated the effects of substitutions 
in the N-terminal domain (NTD). Generally, the 
effects of single-amino acid differences in the 
RBD were consistent regardless of whether ad- 
ditional NTD differences were present or not 
(Fig. 6A). This was also reflected in pairwise 
comparisons of variants with no RBD differ- 
ences, in which we typically found no significant 
differences in titer reactivity (Fig. 6B). This in- 
cluded little evidence for a significant effect on 
serum reactivity of features, such as the NTD 
69 to 70 deletion in the B.1.1.7 variant (Fig. 6B, 
B.1.1.7 versus D614G+N501Y). However, for com- 
parators involving the B.1.351+N417K mutant, 
which has the substantial B.1.351 NTD changes 
that include the 241 to 243 deletion and R2461 
substitution, we did find evidence of differences 
compared with other viruses that had sequence 
identity in the RBD, such as the P.1+T417K and 
B.1.1.7+E484K mutants. Although these differ- 
ences may be associated with the effects of 
these NTD substitutions on titer reactivity [as 
has been found in other cases (32)], we found 
that in the comparisons shown in Fig. 6B, the 
B.1.351+N417K mutant also had lower titers 
against the B.1.351 sera themselves, despite se- 
quence identity of the NTD differences for 
these sera. This indicates that in our data, there 
was a general negative bias in titers measured 
against the B.1.351+N417K mutant, rather than 
antigenic effects of the substitutions within 
the NTD. Others have found that sensitivity to 
NTD substitutions can depend on the levels 
of ACE2 expression in the assay target cell 
line (33), which may relate to us finding little 
evidence overall for NTD effects in the cases 
that we compared. 


Discussion 


The antigenic analyses of SARS-CoV-2 variants 
presented here underscore the advantages of 
an integrated and extensible framework for 
understanding antigenic relationships and 
serum responses in SARS-CoV-2. The antigenic 
map and the antibody landscapes allow the 
comparison of serum responses by variant and 
also provide a quantitative measurement of 
both the response magnitude and breadth after 
different exposures. Antibody landscapes can 
also be applied to analyze how multiexposure 
serum responses distribute across antigenic 
space (34), which is important for ongoing 
studies seeking to compare the immunity built 
through different prospective vaccination regi- 
mens (35, 36). 

Using regression analyses and antibody land- 
scapes to analyze immune responses in vaccination 
sera taken at multiple time points after vacci- 
nation, we quantified and visualized changes in 
response cross-reactivity breadth, disentangling 


Wilks et al., Science 382, eadj0070 (2023) 


it from changes in cross-reactivity that are due 
to differences in response magnitude (Fig. 2). 
The similar estimates of response breadth just 
before the third vaccination, and from 4 weeks 
after the third vaccination, showed that the 
main short-term effect of the third vaccination 
was to boost the magnitude of a response that 
had already become more cross-reactive, rather 
than to generate additional breadth of cross- 
reactivity. This is consistent with studies into 
influenza vaccine responses, in which the pre- 
dominant effect of vaccination is boosting of 
preexisting patterns of prevaccination reac- 
tivity, which was independent of the vaccine 
variant used (34). It is possible that the third 
vaccine dose was responsible for the later ad- 
ditional increases in cross-reactivity in >3 months 
after third vaccine dose samples, although the 
effect size was small and not statistically sig- 
nificant. It is also possible that our observation 
of increasing response breadth over time (Fig. 
2B) would have continued to this point with- 
out a third vaccination. However, increases in 
breadth in the absence of a third dose would 
not have translated to increased protection 
without the effect of the third dose boosting 
antibody titers. 

Different variants and serum responses clus- 
tered on the antigenic map in a way that al- 
lowed the inference of candidates for the amino 
acid substitutions responsible for the main anti- 
genic changes. The impact of substitutions at 
position 484, 417, and 501 agree with other 
work based on deep mutational scanning (DMS) 
(37-39) and neutralization data from vesicular 
stomatitis virus pseudotyped particles (40, 41). 
The large antigenic effect of single substitu- 
tions such as E484K is reminiscent of influenza 
viruses, in which a large portion of the antige- 
nic difference is associated with only a single 
or double substitution located adjacent to the 
receptor binding site on the hemagglutinin 
protein (42-44). This is also observed for the 
determinants of SARS-CoV-2 antigenic evolu- 
tion described here, which are located close to 
the ACE2 binding site on the spike protein (figs. 
$34 and S35). 

Variation in immunodominance among dis- 
tinct antigenic regions is gaining recognition 
as an important aspect of the antigenic evolu- 
tion of pathogens and has also been seen in 
influenza, hepatitis C, and noroviruses (45-47). 
Conversely, the codominance of multiple sites 
in measles virus has been associated with re- 
duced potential for evasion from serological 
reactivity (48). Within SARS-CoV-2, our find- 
ings are consistent with studies showing that 
monoclonal antibodies tend to have different 
regional binding preferences when sourced 
from individuals exposed to different variants 
(10, 49-51). Conclusions suggesting an immuno- 
dominance switch for B.1.351 sera away from 
the 484 region also correspond with published 
data applying DMS to sera after D614G and 
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B.1.351 infection (39). Our results show how 
such immunodominance switches extend to 
vaccination and infections with different var- 
iants and how changes can be associated with 
the amino acid present in the different elicit- 
ing variants. These observations explain the 
wider patterns in the data—in which certain 
serum groups may distinguish clear antigenic 
differences between certain variants although 
others may not—and again underscore that 
antigenic differences between variants are 
not necessarily absolute measures but can be 
relative to the particular sera against which 
variants were titrated and the specific struc- 
tural regions that the particular sera tested pre- 
dominantly target. Immunodominance switches 
also highlight the potential limitations of visual- 
izing antigenic relationships solely in Euclidean 
space, as is done with antigenic maps, and em- 
phasize the need for further complementary 
approaches to their quantification and visu- 
alization, such as the explicit fold-drop com- 
parisons that we use here. 

We note two limitations in the analyses that 
we have presented. First, although the assay that 
we use is FDA certified, it measures neutraliza- 
tion with a lentivirus pseudotype neutraliza- 
tion assay, whose results may differ from those 
of live-virus neutralization assays. However, 
we generally find our fold-difference estimates 
to be within the ranges reported in other studies 
and, in agreement with others (12), we do not 
find an obvious bias compared with live-virus 
neutralization assay results (fig. S7). Second, 
we focus on patterns of serological reactivity 
and antigenic variation found up until the early 
Omicron variants. Beyond this point, it is very 
difficult to source sufficient primary-exposure 
human sera to study antigenic relationships in 
detail, and it is increasingly necessary to rely 
on animal-model sera. Consequently, it is crit- 
ical to assess whether the patterns of antigenic 
relatedness and changes in immunodominance 
found for human sera can be reliably measured 
in different animal models. 

As populations increasingly experience mul- 
tiple exposures, it will be important to investi- 
gate how antigenic differences in multiexposure 
serum responses compare with those inferred 
from primary-exposure sera, in particular with 
immunodominance patterns in mind. For ex- 
ample, findings that infection with Omicron 
BA.1 after a previous exposure to pre-Omicron 
variants predominately boosts antibodies tar- 
geting epitopes shared between pre-Omicron 
and Omicron variants highlight the value of 
understanding how different initial exposures 
dictate the structural regions that were initially 
targeted and how this interacts with a future 
exposure (52-55). Answers to these questions 
will better reveal the variants and substitu- 
tions to which different populations are most 
vulnerable and help anticipate which emerg- 
ing variants may be most at risk of evading 
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current immunity. The choice of vaccine im- 
munogens on the basis of immunodominance 
considerations may become as important as 
their antigenic characteristics. 


Materials and methods 
Specimens and study cohorts 


All samples, including those from vaccine re- 
cipients, were collected in clinical studies with 
information listed in table S1. Convalescent 
sera are assumed but not confirmed first in- 
fections, based on no previously recorded or 
self-reported SARS-CoV-2 vaccination or infec- 
tion, plus additional criteria in some cases, as 
also described in table S1. For convalescent 
samples, the infecting variant was determined 
using whole-genome sequencing. Institutional 
review board (IRB) approvals were obtained 
for each study site. All participants were given 
and signed the informed consent form. 

Vaccination samples came from a variety of 
sources, as also indicated in table S1 and de- 
scribed in more detail below: 


4 weeks post 2x mRNA-1273 


The majority (n = 28) of the 4-weeks post-2x 
mRNA-1273 samples were from the mRNA- 
1273 phase 1 study (NCT04283461) and were 
obtained from the Division of Microbiology and 
Infectious Diseases (DMID), National Insti- 
tute of Allergy and Infectious Diseases (NIAID) 
for the mRNA-1273 phase 1 study team, and 
Moderna. The phase 1 study protocols and re- 
sults are reported previously (57, 58). The phase 
1 trial tested the identical vaccine (mRNA- 
1273), dose (100 ug), and schedule as used in 
the Moderna phase 3 (NCT04470427); titra- 
tions against these samples were also reported 
in (59). A smaller subset of samples (7 = 4) were 
obtained from the Coronavirus Efficacy (COVE) 
phase 3 trial and were preselected as having 
high titers of neutralizing antibodies against 
D614G to enable detection of neutralization 
against the wider range of Omicron variants 
that they were measured against; some titra- 
tions against these samples also appear in (60). 
Finally, an additional 11 sera from the COVE 
phase 3 trial were titrated against D614G and 
the D614G+N501Y mutant, and an addition- 
al 26 sera were titrated against BA.1 and the 
BA.1+A484K mutant. These sera appear in 
the results shown in fig. S29 and S30, respec- 
tively, but are not included in the general anal- 
ysis due to lack of titrations against further 
variants. 


>3 months post 2x mRNA-1273 


All samples relate to sera previously described 
in (67) that were obtained before homologous 
mRNA-1273 boosting (50 pg) in 16 study par- 
ticipants who had received two inoculations of 
mRNA-1273 (100 wg) under emergency-use au- 
thorization. These participants were part of 
a subset analysis of high D614G responders in 


Wilks et al., Science 382, eadj0070 (2023) 


the NIAID heterologous boost study DMID 
21-0012. 


4 weeks post 3x mRNA-1273 


Sixteen samples relate to sera from the same 
individuals in the >3-months post 2x mRNA- 
1273 group, but obtained 29 days after homol- 
ogous MRNA-1273 boosting (50 ug), again as 
described and appearing in (67). An additional 
10 samples also relate to samples collected as 
part of the DMID 21-0012 and also relate to 
samples previously described in (60), which 
were also a subset of high D614G responders. 


>3 months post 3x mRNA-1273 


These samples were taken as part of the COVID-19 
Variant Immunologic Landscape (COVAIL) ran- 
domized clinical trial (NCT 05289037) and are 
from the subset of individuals who had received 
a previous mRNA-1273 vaccine regimen fol- 
lowed by a mRNA-1273 booster dose. Samples 
were taken directly prior to a fourth mRNA- 
1273 dose and, as for other vaccine groups, we 
subset to those with no previous record of a 
SARS-CoV-2 infection. 


4 weeks post 2x mMRNA-1273.351 


Samples were from COVID-19 naive participants 
who received two 100-.g doses of mRNA-1273.351 
on days 1 and 29 as part of a clinical trial (NCT 
04785144), experimental arm 2C. 

The use of the specimens for the research 
conducted in this manuscript is covered under 
IRB protocols Pro00093087 and Pro00105358. 

Where samples in the same serum group 
came from a mixture of cohorts or agencies, 
we did not find evidence for differences in the 
patterns of reactivity according to cohort or 
agency (fig. $3). 


Pseudotype virus assay 


Neutralization was measured with lentiviral 
particles pseudotyped with SARS-CoV-2 spike 
and containing a firefly luciferase (Luc) reporter 
gene for quantitative measurements of infection 
by relative luminescence units (RLU). The assay 
was performed in 293T/ACE2-MF provided by 
Michael Farzan (-MF) and Huihui Mu. The de- 
rivation of the cell line is described in (62). Pseu- 
doviruses were prepared, titrated, and used for 
measurements of neutralizing antibodies essen- 
tially as described previously (59). Briefly, an ex- 
pression plasmid-encoding codon-optimized 
fullength spike of the Wuhan-1 ancestral sequence 
(VRC7480) was provided by Barney Graham and 
Kizzmekia Corbett at the Vaccine Research Cen- 
ter, National Institutes of Health (USA). Muta- 
tions were introduced into VRC7480 either by 
site-directed mutagenesis using the QuikChange 
Lightning Site-Directed Mutagenesis Kit from 
Agilent Technologies (Catalog no. 210518) or 
were created with a spike gene synthesized by 
GenScript, using the spike sequence in VRC7480 
as template. All mutations were confirmed with 
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full-length spike-gene sequencing by Sanger Se- 
quencing, using Sequencher and SnapGene for 
sequence analyses. Pseudovirions were produced 
in HEK293T/17 cells (ATCC cat. no. CRL-11268) 
by transfection using Fugene 6 (Promega Cat no. 
E2692) and a combination of spike plasmid, 
lentiviral backbone plasmid (pCMV AR8.2), and 
firefly Luc reporter gene plasmid (pHR’ CMV 
Luc)36 in a 1:17:17 ratio in Opti- MEM (Life Tech- 
nologies) (CMV, cytomegalovirus). Transfection 
mixtures were added to preseeded HEK 293T/ 
17 cells in T-75 flasks containing 12 ml of growth 
medium and incubated for 16 to 20 hours at 
37°C. Medium was removed and 15 ml of fresh 
growth medium added. Pseudovirus-containing 
culture medium was collected after an additional 
2 days of incubation and clarified of cells by 
low-speed centrifugation and 0.45-um micron 
filtration. Median tissue culture infectious dose 
(TCID;9) assays were performed as described 
previously (59). 

For measurements of neutralization, a pre- 
titrated dose of virus was incubated with eight 
serial fivefold dilutions of serum samples (1:10 
or 1:20 starting dilution) in duplicate in a total 
volume of 150 ul for 1 hour at 37°C in 96-well 
flat-bottom culture plates. 293T/ACE2-MF cells 
were detached from T-75 culture flasks using 
TrypLE Select Enzyme solution, suspended in 
growth medium (100,000 cells/ml) and imme- 
diately added to all wells (10,000 cells in 100 yl 
of growth medium per well). One set of eight 
wells received cells and virus (virus control), 
and another set of eight wells received cells 
only (background control). After 66 to 72 hours 
of incubation, medium was removed by gentle 
aspiration, and 30 ul of Promega 1X lysis buffer 
was added to all wells. After a 10-min incuba- 
tion at room temperature, 100 ul of Bright-Glo 
luciferase reagent was added to all wells. After 
1 to 2 min, 110 ul of the cell lysate was trans- 
ferred to a black or white plate. Luminescence 
was measured using a GloMax Navigator lu- 
minometer (Promega). Serum samples were 
heat-inactivated for 30 min at 56°C prior to 
assay. Neutralization titers are the inhibitory 
dilution (ID) of serum samples at which RLUs 
were reduced by either 50% (IDs) or 80% 
(IDgo) compared with virus control wells after 
subtraction of background RLUs. The spike mu- 
tations of all variants used in this study are 
shown in table S2. 


Titer analyses 
Excluding outlier sera 


The serum samples used in this study were 
collected from individuals with no reported 
previous vaccination or infection preceding 
the exposure that was associated with their 
respective serum group, with further addi- 
tional criteria for some sample sources as de- 
scribed in table S1. However, as with most 
studies, it is possible that for some individ- 
uals, additional previous (even asymptomatic) 
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infections were not detected or reported. Based 
on this possibility, we sought to identify sera 
with patterns of reactivity that were less likely to 
be consistent with a true first exposure. As such, 
we applied selection criteria to all the sera to 
identify cases where a titer was measured that 
was higher than the serum-group-homologous- 
variant titer by more than twofold (for exam- 
ple, a serum from the B.1.617.2 group that had 
a titer against D614G that was more than two- 
fold higher than the titer measured against 
B.1.617.2). Subvariants of B.1.617.2 and BA.1 
(Fig. 3, smaller circles) were excluded from this 
analysis, as well as the variant P.1, which we 
noted tended to often have high titers gener- 
ally. For example, P.1 often has higher-than- 
homologous titers for sera from the B.1.351 
serum group, possibly due to assay-related biases 
inflating titers for the particular P.1 variant, as 
it has been observed in other assays using lenti- 
viral pseudotypes (9). B.1.429 was used as the 
homologous variant for sera from the B.1.637 
group, being the genetically most similar var- 
jiant titrated, while D614G was used for mRNA- 
1273 vaccination groups; and B.1.351 was used 
for the mRNA-1273.351 vaccination group. Fif- 
teen of the total of 207 sera had titers exceeding 
the twofold-higher-than-homologous criteria de- 
scribed above and are highlighted in red in the 
titer plot shown in fig. $2. 

As an additional step, we also excluded nine 
sera as likely second-infection responses based 
on visual inspection of patterns of reactivity 
that deviated strongly from the general trend 
of titer reactivity for a given serum group. These 
sera are additionally highlighted in fig. S2 in 
blue. In particular, this criteria applied to sera 
from the BA.1 serum group where we noted a 
clearly bimodal pattern of reactivity with three 
post-BA.1 infection sera that showed high titers 
against the D614G variant and four sera which 
did not. Based on the large antigenic differ- 
ence between D614G and BA.1 and the types 
of reactivity seen in more controlled animal- 
model BA.1 infections (63), we judged the four 
more-BA.1-specific sera to be likely first infec- 
tions and excluded the rest of the group as likely 
cases that had had a previous exposure to a pre- 
Omicron variant. 


Calculation of Geometric Mean Titers (GMT) 
and fold changes for nondetectable titers 


Calculation of the mean titers becomes non- 
trivial when experiments contain nondetectable 
titers or, in data science terminology, censored 
data. Although computation of means of data 
with censored values has been around for over 
60 years (64-66), it does not seem to be com- 
monly employed when analyzing data from 
neutralization assays. Correct estimation of the 
GMT becomes especially important if one wants 
to quantify the amount of escape by variants 
like the Omicron variants, with many nonde- 
tectable titers. Setting nondetectable titers to 
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half the value of an assay’s limit of detection is 
acommon method employed in virology, how- 
ever, it may overestimate the GMTs for such 
variants. Below we give a brief overview of our 
approach to dealing with such censored data, 
functions for which are included in the asso- 
ciated titertools package (67). 

The main step in estimation of the titers is 
the construction of a likelihood function that 
describes the probability of observing both the 
nondetectable and detectable titers. In the 
dataset used for this study, we deal with only 
left-censored data. We assume that log, titer 
values of a particular serum group are mod- 
eled by a normal density function, f(a; u, 0), 
with unknown mean p and standard deviation 
(SD) o, and we let y;(7 = 1,...,M) denote de- 
tectable titers and 7;(¢ = 1,...,N) denote the 
nondetectable titers. Then the log likelihood 
function, LL, is given by 


N 
LL(u, 6) = So logf" f(a: 1,0) + 
= 


M 
Slog s (vi: 4,6) 
i=1 


In this expression, the term involving the in- 
tegral is the likelihood of observing the censored 
titers, whereas the other term is the likelihood 
of observing the noncensored titers. This like- 
lihood function then can either be used in a 
maximum-likelihood estimation pipeline or, 
if priors are chosen for p and o, in a Bayesian 
posterior-calculation pipeline from which the 
value of 1 can be estimated in various ways. For 
the calculations in this paper, confidence in- 
tervals were calculated using the highest den- 
sity interval (HDI) method of calculation from 
10,000 samples. 

When calculating the fold changes in titers, 
the same approach to nondetectable titers was 
taken, with, for example, a change from 40 to 
<20 equating to a change of <-1 on the log, 
scale, for which the likelihood function would 
be the integral of the normal density function 
from - to -1. For estimating fold changes, we 
used the function “log2diff’ in the titertools 
package, with parameter “sigma = 0.87,” based 
on our estimates of the magnitude of variation 
in paired measurement repeats (fig. S10). 


Titers after accounting for estimated 
individual effects 


The data used in this paper contain sera col- 
lected from different individuals infected with 
the same variants. Even when these titers are 
grouped according to the infecting variant, one 
can see quite large variations in titer magni- 
tudes, whereas the qualitative appearance of 
the titer curve is similar (Fig. 1). Some of this is 
due to individual variation in the overall re- 
activity of sera collected from different indi- 
viduals, for example, related to the magnitude 


6 October 2023 


of the serological response. We model this be- 
havior in the following way: Assume that ¢; is 
the log, titer measurement of antigen 7 against 
serum j from the serum group J. Then 


ty = Sig +7) + eg(i =1,....n andj = 1,...m) 


where s;; is the average log, titer of antigen 7 
against the serum group J, 7; is the serum re- 
activity bias of the serum j, and e¥ is the in- 
dependently and normally distributed log-titer 
noise. 7 is the number of antigens, and 7m is 
the number of sera. For each serum group, 
parameters s;; and 7; are then found to max- 
imize the sum of the likelihood of each mea- 
sured log titer given the observed log titer, tj. 
For the fitting procedure, we used the function 
“estimate_sr_effects” in the titertools package, 
with parameter “sigma = 0.62,” according to 
our estimates of the magnitude of measure- 
ment noise in a single measurement (fig. S10). 

Antigenic cartography has its own method of 
dealing with serum reactivity bias. The formula 
for the target distance (provided in “Antigenic 
cartography” below) indicates that for any 
serum, the antigen with the highest titer (usually 
the homologous antigen, but not always) is auto- 
matically adjusted to have O distance to that 
serum. This is true regardless of the value of the 
maximum titer, and this adjustment also takes 
care of the individual serum reactivity in a way 
that is compatible with antigenic mapmaking. 


Calculating fold-drop differences 
in vaccine sera 


When exploring differences between postin- 
fection and different postvaccination responses 
homologous to the same antigen, one might 
expect that the fold drop in serum responses 
(compared with the homologous antigen) are 
similar in some way. A simple model one can 
use is to introduce an overall factor that re- 
lates the responses of sera from the two groups. 
In terms of the model in the previous section, 
this means 


Six) = dAsni® 


Shy — Sit = Ay (Sn 


where K and J are two serum groups, / is the 
homologous antigen to them, 7 is any other 
antigen, and s,y are as described in the previous 
section. Note that terms such as s,7 — S;7 give 
the (noiseless) fold drop of titers of antigen 7 
from the titers of the antigen 2 (homologous 
to J), measured against the serum group J. 
Then for a specific serum j from the serum 
group J, we have 
try — tig = Sar — Sit + Eng — EG 
= dyASix + Eyn 


where the last term is again normally distrib- 
uted however with \/2 times the SD of the 
titer noise. 
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An example would be K = D614G postinfec- 
tion serum group, J = mRNA-1273 serum group, 
h = D614G antigen, and 7 = any other antigen 
(Fig. 2). The term d, is the “factor” specific to 
serum group J, which relates its responses to 
the response of serum group K. (This term 
obviously also depends on K; however, we 
imagine that K is a constant postinfection 
serum group and that J varies between differ- 
ent postvaccination serum groups, so by abuse 
of notation, we only index this term with J.) 
Note that taking difference of titers cancels 
out the serum response bias specific to serum). 

When applying this model, we compared the 
serum groups D614G, 4 weeks post 2x mRNA- 
1273, >3 months post 2x mRNA-1273, and 4 weeks 
post 3x mRNA-1273 (with shared homologous 
variant D614G), and we compared serum groups 
B.1351 and 2x mRNA-1273.351 (with shared 
homologous variant B.1.351). In the D614G and 
B.1.351 postinfection groups, we fixed d, as 1, 
estimating fold-change differences in the post- 
vaccination groups retive to the postinfection 
sera. For the fitting procedure, we used the func- 
tion “compare_crossreactivity_breadth” in the 
titertools package, with confidence intervals 
calculated for the parameters based on the 
highest density interval of 5000 sampling runs 
after 1000 warmup runs. 


Imputing censored titers 


For figs. S11, S12, and S20, we imputed nonde- 
tectable titers in order to not bias the dis- 
tributions that are visualized. In these cases, 
censored titers (e.g., <20) are imputed using the 
function “impute_gmt_titers” from the titer- 
tools package. In these cases, censored titers 
are replaced with numeric titers drawn from a 
censored normal distribution with mean and 
SD parameters that are estimated from the titer 
data as described in the “Calculation of GMT” 
section above. 


Antigenic cartography 


Antigenic cartography is a multidimensional 
scaling method (MDS) (68) for the reduction of 
dimensions in large-titer datasets in order to 
not only visualize large amounts of titer data 
in a lower-dimensional space but to also draw 
conclusions about the antigenic evolution of 
pathogens by studying the positions occupied 
by these viruses on the antigenic map. 

In order to construct an antigenic map, one 
starts with a titer table where columns are sera 
and rows are antigens. Denoting the values of 
such a table by ¢;; (where 7 denotes the antigen 
index and j the sera index), titer values repre- 
sent a degree of dissimilarity of the 7th antigen 
to the jth serum. We turn this dissimilarity 
score into a similarity score by the following 
transformation: 


Dy = mae [logs (ty)] — logs (ty) 
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The D,; values are called the “target distances.” 
Antigenic cartography strives to find a multi- 
dimensional (usually two or three dimensions) 
placement of antigens and sera, such that the 
Euclidean distance between antigens and sera 
conform as well as possible to the target dis- 
tances. This is achieved by an optimization of 
the antigen and serum coordinates that mini- 
mize the stress function 


Ys - dy) 

ij 
where d, represents the Euclidean distance 
between the 7th antigen and the jth serum 
points in the antigenic map. The minimization 
is achieved using the L-BFGS algorithm (69) to 
find the best cost-minimizing coordinates start- 
ing from a set of randomly determined initial 
conditions. Note unlike most common MDS 
problems, the similarity matrix is not square 
and might contain masked or censored values 
and therefore requires a slightly custom ap- 
proach. In particular, if ¢; is a nondetectable 
titer, then the stress is computed as 


(Diy — diy + 8)(Dyy — dy + 8)” 
where s is a step-size factor which is 1 for 
discrete data (and their means) such as those 
obtained from Haemagglutinin Inhibition as- 
says, or O when the output data are continuous 
and measured from titer curves, and 
1 

99) = 0 
is a damping function whose role is to give more 
freedom to nondetectable titers to be opti- 
mized for a value less than the limit of detec- 
tion of the assay. 

We used the Racmacs package to compute 
the antigenic maps presented in this manu- 
script (70). The map was constructed using 
1000 optimizations, dilution step size of 0, with 
the minimum column basis parameter set to 
“none.” In order to allow for accurate coordina- 
tion of serum positions in two dimensions, sera 
with less than three detectable titers were ex- 
cluded from the maps. An exception was made 
for the BA.2 serum sample, which was included 
despite only having two detectable titers because 
it was the only serum of its group, although the 
serum position for this serum will therefore 
be less constrained in its antigenic position. 


Assessing model fit 


We performed different analyses to ascertain 
that the 2D antigenic map is a good represen- 
tation of the titer data and to ensure that the 
map is robust to noise from measurement errors. 

First, to get an understanding of the mea- 
surement error of individual titrations, we 
analyzed a set of repeat titrations of 60 sera 
against eight of the variants used in the assay 
and found that the mean difference in log, 
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titers between repeated titrations was 0.209 
with a SD of 0.84, with measurement error 
being approximately normally distributed (fig. 
$10). Accounting for the systematic bias of ti- 
trations between repeats, and assuming a mean 
of 0 without it, resulted in a total SD of 0.87 
on the log, scale. Since measurement error 
would be present for both the first and sec- 
ond repeat titration, this would be consist- 
ent with noise per measurement with a SD of 

0.87" /2 = 0.62. 

Subsequently, we investigated how well anti- 
genic maps of different dimensions represent 
the measured patterns of serological reac- 
tivity. In our previous experience with other 
antigenically variable pathogens, the dimen- 
sionality of relationships can be lower [for 
example, two dimensions in seasonal influ- 
enza and dengue virus (13, 20)] or higher [for 
example, three dimensions for A/H5N1 and 
swine influenza (17)] or may simply not be well 
represented in Euclidean space. Under cross- 
validation, a 2D map represented the data best 
(fig. SQA). The overall arrangement of the var- 
iants and sera in three dimensions is similar to 
that in two dimensions (fig. S9, B andC), with 
the exception of the BA.4/BA.5 variant, which in 
three dimensions occupies a position more dis- 
tant from BA.1 and closer to B.1.617.2 and B.1.351 
than visualized in two dimensions. The dimen- 
sional analysis was performed with Racmacs by 
using the function “dimensionTestMap.” 


Assessing goodness of fit using fitted 
and measured titers 


Next, we assessed how well the 2D map fits the 
measured titers. We first investigated whether 
the measured log, titers correlated with the 
titers fitted in the antigenic map. Overall, the 
fitted and measured titers corresponded well 
(fig. S11). When comparing the mean differ- 
ences of the fitted versus measured titers (mean 
of 0.04 and SD of 1.05, shown as a histogram in 
fig. S12), they are comparable to the estimated 
size of measurement error (fig. S10). This sug- 
gests that most of the residual error observed in 
the antigenic map fit of the data are consistent 
with measurement error but that not all var- 
iation in the titers is captured. 

We investigated whether any variant and 
serum group combinations exhibited consistent- 
ly larger differences between fitted and mea- 
sured titers (fig. S13). We find some variants 
showing differences in fitted versus measured 
titers in some serum groups, such as the B.1.617.2 
variant having larger measured than fitted ti- 
ters against the B.1.617.2 sera. As discussed in 
the main text, we speculate that when compared 
with other datasets (fig. S7), our measurements 
of B.1.617.2 serum antibodies against B.1.617.2 
were biased toward higher values. Also as de- 
scribed in the main text, in cases such as the 
BA.4/BA.5 variant, a third dimension is re- 
quired to better reflect the measured patterns 
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of reactivity. However, some of these system- 
atic residual errors in the map fit will also be 
reflective of the fact that not all patterns of 
cross-reactivity between the serum groups can 
be captured in the constraints of the Euclidean 
space of an antigenic map. In particular, some 
limitations may relate to the immunodomi- 
nance differences identified in this paper, where- 
by one serum may see an antigenic difference 
between two variants and one serum may not. 
Antigenic cartography allows for this to some 
extent by permitting serum positions not to be 
coincident with homologous antigen positions. 
This means that a serum point can be posi- 
tioned directly between two variants for which 
it sees no difference, thus being antigenically 
equidistant from both. However, this cannot 
be reconciled in cases where a serum has max- 
imum reactivity to two variants and hence an 
expected antigenic distance of 0 between them. 
Overall though, we judge that the 2D map is a 
reasonable representation of the broader scale 
underlying antigenic relationships among var- 
iants, providing a useful complement to raw 
titer analysis, and in some cases may compen- 
sate for systematic titer biases like that of P.1 
that are otherwise spurious. 


Assessing the robustness of variant 

and serum positions to measurement error 
and presence or absence of variants 

and serum groups 


We performed three experiments to determine 
the uncertainty of the positions of variants and 
sera in the antigenic map. First, we estimated 
how titration measurement error and varia- 
bility in antigen reactivity would affect con- 
clusions about antigenic map positions (fig. 
S14). We performed a “smooth” bootstrap with 
normally distributed noise added to the titer 
measurements and/or to antigen reactivity. 
Noise added to the titer measurement had a 
SD of 0.62, in keeping with the SD of repeat 
titrations shown in fig. S10, and random noise 
for antigen reactivity had a SD of 0.4, indi- 
cating that systematic biases in titers like that 
suspected for P.1 may also be present to some 
degree for other strains in the assay. We per- 
formed 500 repeats of adding random noise 
(using the Racmacs function bootstrapMap, 
method = “noisy”) and summarized the variant 
positions by showing the area in which 68% 
(one SD) of the positional variation of a serum 
or antigen is captured. In general, we found 
that the antigen and serum positions are ro- 
bust to both the random and systematic noise 
added, without affecting the main conclusions 
about the antigenic relationships between the 
variants included. 

Next, we assessed whether the positions of 
the variants are robust to the exclusion of each 
variant in turn. We find that positions of var- 
iants are robust to the exclusion of variants 
(fig. S15), with the small movement observed 
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with the exclusion of D614G and B.1.351. We 
then investigated whether positions of var- 
iants are robust to the exclusion of each serum 
group in turn (fig. S16). Overall, the map was 
robust to the exclusion of single serum groups, 
with the greatest movement observed when ex- 
cluding the 2x mRNA-1273.351 vaccine serum 
group and the B.1.617.2 and BA.1 convalescent 
serum groups. Relatedly, since we incorporated 
the information from the different post-mRNA- 
1273 vaccine serum groups into the map, we 
also explored how the antigenic map compared 
if constructed from postinfection sera alone 
(fig. S17). For some variants that had not been 
titrated against post-D614G infection sera (and 
were thus less well coordinated without includ- 
ing post-D614G vaccination sera titrations), we 
did see larger discrepencies in antigenic posi- 
tion in the postinfection sera-only map, in par- 
ticular for the BA.4/BA.5 variant. (fig. S17A). For 
other variants, antigenic relationships were 
consistently well described in the presence or 
absence of postvaccination sera (fig. S17B). We 
also included the outlier sera identified in fig. 
82 and found little difference in map topology 
(fig. S19). 

We also evaluated the robustness of map 
positions on particular titrations by perform- 
ing 500 “Bayesian bootstrap” repeats, where in 
each repeat, titers were reweighted accord- 
ing to a dirichlet distribution. This method is 
statistically similar to the more common meth- 
od of bootstrapping using random sampling 
with replacement, but it ensures that titers are 
never fully removed from the dataset, which 
can happen when sampling with replacement 
and lead to underconstrained antigenic map 
solutions in some of the bootstrap repeats. We 
performed this reweighting on both variants 
and sera combined, as well as antigens and 
sera separately, using the Racmacs function 
bootstrapMap, method = “bayesian” (fig. S20). 
Map positions were largely robust to this re- 
weighted sampling of titers. 

We also show that the error is approximate- 
ly equally distributed between all variants and 
sera (fig. S21A). Positions of variants and sera 
are also robust when allowing the variant and 
sera to move in the map while not increasing the 
overall map root mean square error (RMSE) 
between map and table distance by more than 
one unit (fig. S21B). 


Assessing predictive power of the 
antigenic map (cross-validation) 


To investigate whether the antigenic map is 
able to predict missing titers, we performed 
500 cross-validation replicates, where in each 
replicate we removed 10% of all titrations, and 
predicted the missing titers from the result- 
ing map. On the logs scale, where difference 
of 1 represents a predicted titer that is 2-fold 
higher than the measured titer and —1 repre- 
sents a titer that is twofold lower, the mean 
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difference between predicted and measured 
titer was 0.53 with a SD of 1.61 (fig. S22). We 
also considered the difference between pre- 
dicted and measured titers split by variant 
and serum group (fig. $23). As for figs. S12 
and S13, these results imply that not all pat- 
terns of cross-reactivity between the serum 
groups can be captured in the constraints of 
the 2D Euclidean space of the map but that 
overall, the 2D relationships are largely pre- 
dictive of titers among sera and variants. 


Construction of the antibody landscapes 


The construction of the antibody landscapes 
differs from the methodology used in previous 
publications (34, 77) , since each serum repre- 
sents a likely first infection and was used to 
construct the map itself with explicit assump- 
tions about how reactivity should be distributed. 
Given a serum with coordinates (2m, Ym), let tm 
be its maximum log, titer/10. Then (2m, Ym, tm) 
represents the highest point of the antibody 
landscape for this serum. Given any other 
point p= (a#,y) on the 2D base space, the 
third coordinate, t, of the surface at this point 
is given by 


t= tin _ d(D, Pm) 


where d(p, Pm) is the Euclidean distance be- 
tween p and p». In particular, the serum point 
and maximum titer for a given serum describe 
how reactivity would be expected to vary in a 
cone-like fashion across antigenic space, with 
its apex at the serum point at a height equal 
to the maximum measured log, titer/10 and 
diminishing at a rate of a twofold reduction 
in titer with each antigenic unit on the map. The 
serum group averages shown in Fig. 4 therefore 
simply represent an average of all these indi- 
vidual slope = 1 cone-like responses for the 
relevant sera, displaying the GMT of all the 
individual titer predictions for the group. 

In this case, since samples were collected 
after a presumed single exposure, individual 
landscapes simply visualize the slope 1 cone-like 
model of reactivity that is fit when construct- 
ing the antigenic map, rather than refitting the 
same data with a nonparametric modeling ap- 
proach that can also be applied. 
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Transonic dislocation propagation in diamond 


Kento Katagiri"?*+5*, Tatiana Pikuz®, Lichao Fang**°, Bruno Albertazzi’, Shunsuke Egashira’, 
Yuichi Inubushi®®, Genki Kamimura’, Ryosuke Kodama‘, Michel Koenig”’, Bernard Kozioziemski?°, 
Gooru Masaoka’, Kohei Miyanishi®, Hirotaka Nakamura’, Masato Ota’, Gabriel Rigon’}, 

Youichi Sakawa’, Takayoshi Sano“, Frank Schoofs!”, Zoe J. Smith’, Keiichi Sueda®, 

Tadashi Togashi®®, Tommaso Vinci’, Yifan Wang**°, Makina Yabashi®°, Toshinori Yabuuchi®®, 


Leora E. Dresselhaus-Marais**°, Norimasa Ozaki? 


The motion of line defects (dislocations) has been studied for more than 60 years, but the maximum 
speed at which they can move is unresolved. Recent models and atomistic simulations predict the 
existence of a limiting velocity of dislocation motion between the transonic and subsonic ranges at 
which the self-energy of dislocation diverges, though they do not deny the possibility of the transonic 
dislocations. We used femtosecond x-ray radiography to track ultrafast dislocation motion in 
shock-compressed single-crystal diamond. By visualizing stacking faults extending faster than the slowest 
sound wave speed of diamond, we show the evidence of partial dislocations at their leading edge 

moving transonically. Understanding the upper limit of dislocation mobility in crystals is essential to 
accurately model, predict, and control the mechanical properties of materials under extreme conditions. 


otion of dislocations inside a material 
caused by external stress is related to the 
material’s mechanical properties and 
its deformation dynamics (J, 2). When a 
ductile material is stressed, dislocations 
inside the material move to locally accommo- 
date that force, giving rise to plasticity (3). The 
ductility common to metals is usually absent in 
brittle materials such as diamond, but even brit- 
tle materials can exhibit ductility under some 
types of extreme conditions like shock-induced 
high strain-rate deformation (4-6). Although 
the basic mechanisms of dislocation-mediated 
plasticity are sometimes invariant to the rate of 
strain in the material as it responds to external 
stresses (7), numerous studies have observed 
cases that show high rate sensitivity (8). 
At the highest strain rates, dislocations move 
during deformation at velocities that approach 
the sound speeds of the material. Dislocation 


Fig. 1. Femtosecond x-ray A 
radiography on shocked 
diamond. (A) The diamond 
sample is stressed by 
shock waves driven by 
optical laser irradiation. 

In situ x-ray radiography 

is measured by using a 
femtosecond XFEL pulse, 
irradiating the sample 
perpendicular to the 
propagation axis of the 
shock waves. A lithium 
fluoride (LiF) crystal 


Diamond 


theory predicts that the self-energy and stress 
of dislocations diverge when a dislocation ap- 
proaches a limiting (or critical) velocity in a 
given crystal (9-12). This implies that disloca- 
tions are forbidden to travel at those limiting 
velocities. Whereas the limiting dislocation ve- 
locities in an isotropic crystal coincide with the 
longitudinal and transverse sound speeds (73, 14), 
anisotropic single crystals such as diamond have 
one longitudinal (c,) and two transverse (c; = ¢3), 
sound speeds, which are the same order of mag- 
nitude but do not always coincide with the lim- 
iting velocities (10-12, 15-18). Critical velocities 
offset from the sound speeds are attributed to the 
orientation of a dislocation’s motion (15). The 
slowest sound speed, ¢3, defines the separation 
between subsonic and transonic speed regimes. 

To date, transonic or supersonic (i.e., faster 
than cs or ¢,, respectively) dislocation motion 
in a real crystal has not been observed exper- 


LiF detector 


[110] 


Drive Laser Mo} (007) 


imentally. The only reported experimental i 
dence of dislocations moving faster than’ ~~ 
slowest limiting velocity was in a plasma crystal 
(19). By contrast, numerous theory and molec- 
ular dynamics (MD) simulation studies have 
predicted the existence of transonic or even 
supersonic dislocation motion, indicating that 
the limiting velocities should not be the up- 
per limit of dislocation motion (9, 20-26). 
Gumbsch and Gao (9) used atomistic simu- 
lations to observe ultrafast dislocation motion 
in tungsten. Their simulations showed that 
stable motion of dislocations is possible for 
both transonic and supersonic speeds, but only 
for dislocations created above the limiting speed, 
thus avoiding the need to accelerate across the 
infinite-energy limiting velocity. 

Although creating dislocations at such high 
speeds is relatively straightforward for MD 
simulations, experiments that measure dis- 
locations have not been able to access those 
rapidly driven conditions. With >10’ s“ strain 
rates, the shock compression technique offers 
a distinctive system for studying high-velocity 
dislocations because the energy discontinu- 
ity at the shock wavefront can create disloca- 
tions that initially move faster than the limiting 
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is placed at the XFEL downstream to collect the spatial intensity distribution of the transmitted x-rays. (B) A representative image that we captured by using 


the LiF crystal to visualize the elastic-plastic shock wavefronts propagating along the shock direction. Behind the plastic wavefront, stacking faults appear as dark 
and light bands, indicating the existence of partial dislocations that travel with the plastic shock wavefront (magenta arrow). The yellow dotted line serves to 


guide the eyes to the radiation. 
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velocity (15, 27). Here we present experimental 
results that show shock-induced dislocation 
motion moving transonically in single-crystal 
diamond, using femtosecond x-ray radiogra- 
phy. We discuss our interpretation and as- 
signment of the relevant image features and 
include a discussion of how the dislocation 
speeds and related plasticity indicate radiation 
and drag that imply a relationship between 
the microstructure and bulk elastic-plastic 
behaviors. Understanding the speed of the 
fastest dislocation motion is required to ac- 
curately predict and control the dislocation 
dynamics and plasticity of deformed solids, 
which can be modeled by discrete dislocation 
dynamics (28, 29). Such ultrafast motion of 
dislocations strongly affects the mechanical 
responses of materials in ways that are es- 
sential across numerous applications. 


Visualization of stacking faults 


Our experiments were conducted using the 
X-ray Free Electron Laser (XFEL) at SPring-8 
Angstrom Compact Free Electron Laser (SACLA). 
At SACLA’s Experimental Hutch 5, a high- 
intensity nanosecond-pulsed optical laser is 
aligned to the same spatial position as the 
femtosecond-pulsed XFEL, and its timing is 
synchronized to perform in situ x-ray mea- 
surements of laser-shocked materials (30). We 
used type Ila single crystals of diamond syn- 
thesized by chemical vapor deposition. Our 
diamonds were cut along two different ori- 
entations to enable our shock to propagate 
along the [100] and [110] directions, to examine 
the orientational difference in the plasticity me- 
diated by shock-induced defects (see fig. S9 for 
[111] shock data). We estimate the peak shock 
stresses applied to the diamond samples to be 
184 + 16 GPa and 92 + 15 GPa for [100] and 
[110] shock directions, respectively (78). These 
shock stresses are high enough to make dia- 
mond yield (5), generating strong shear stresses 
that initiate failure mechanisms. When diamond 
yields during shock compression, the shock 
wave’s front splits into two: a leading elastic 
front that is followed by the plastic front in 
which dislocation propagation occurs. 
Although several x-ray imaging techniques 
are used for in situ measurements (31-34), x-ray 
radiography is one of the original ones used to 
nondestructively visualize structures in bulk 
materials (35). X-ray radiography spatially maps 
the x-ray intensity transmitted through a ma- 
terial of interest, giving image contrast that 
is usually based on the absorption by the sam- 
ple. The contrast that we observe in this work 
is mainly due to the scattering of the x-rays 
rather than absorption; despite the similarity 
in contrast mechanism, our radiography set- 
up is distinct from the phase contrast imaging 
setups used in other XFEL shock experiments 
(32). To perform the in situ x-ray radiography, 
we used an unfocused and single-pulsed XFEL 
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Fig. 2. Visualization of stacking faults. (A) X-ray radiography images of the diamond shocked along the 
[100] direction. The numbers on each image indicate the XFEL probe timing relative to the drive laser 
irradiation. The vertical lines that appear across the entire field of view are the texture on the unpolished 
lateral surfaces of the diamond. The two large circles with concentric scattering patterns are caused by 
scattering from the debris on a beryllium window upstream of the XFEL. (B) Undistorted diamond lattice 
structures (37) with the corresponding plane orientations. (C) Fast Fourier transform (FFT)-filtered image 
showing the stacking faults observed in the 16-ns shot. The corresponding area is indicated as the yellow 
dashed rectangle in (A). The orange arrows serve to guide the eyes to one of the bands formed nonparallel to 
the diamond {111} planes (see text). (D) X-ray radiography images when diamonds are shocked along 
[110] direction. The 12-ns image is identical to the one shown in Fig. 1B. (E) Undistorted lattice structure 
showing the plane orientations. (F) FFT-filtered image of the [110] shock at 16 ns. The errors shown 

on the angles noted in (C) and (F) represent lo of multiple measurements (18). 


beam with a photon energy of 7.0 keV and 
pulse duration of ~8 fs to illuminate the dia- 
mond along the axis perpendicular to the shock 
propagation direction (Fig. 1). We placed a 
lithium fluoride (LiF) crystal downstream of 
the sample along the XFEL beam as a vacuum- 
compatible imaging detector that offers a >10° 
dynamic range and ~1-um spatial resolution 
over a wide field of view (36). The distance be- 
tween the diamond sample and the front sur- 
face of the LiF crystal was 112 mm. 

Our x-ray radiography for a diamond shocked 
along the [110] direction captures two shock 
wavefronts, corresponding to a preceding elas- 
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tic wave and a following plastic wave, travers- 
ing the diamond sample from the bottom to 
the top of the image (Fig. 1). Behind the plastic 
wavefront, light and dark bands appear diago- 
nal to the shock direction. In all cases, the two 
characteristic vectors of the shock wave and the 
XFEL imaging beam’s direction indicate the an- 
gles of the linear banded features appear parallel 
to the {111} planes of the crystal. When the {111} 
planes are set not along the XFEL’s probing 
direction (i.e., the sample was placed 90° rotated 
about the shock propagation axis), these banded 
features appear much fainter (fig. S10). This in- 
dicates that the image contrast of the banded 


2 of 4 


RESEARCH | RESEARCH ARTICLE 


> 


300 
—@— [100]: vy = 16.5+2.0 um/ns 
—— [110]: vy = 19.540.8 pm/ns 
250+] 


2007 


1507 


1005 


507 


Dislocation propagation distance, x, [um] 


XFEL Delay, t [ns] 


Supersonic 


2"¢ Transonic 


Dislocation velocity, vy [km/s] 


16- [100] 
144 c 
1 Transonic 
124 C3 
Subsonic | 
10 T T T 
3.5 4.0 4.5 5.0 5.5 


Density, e [g/cm] 


Fig. 3. Transonic dislocation motion in shocked diamonds. (A) Measured dislocation propagation 
distances in diamond (x,y) as a function of the XFEL delay (t). The xy = O um denotes the interface between 
the ablator and diamond, whereas t = 0 ns denotes the time that the laser hits the ablator. The blue 
circles and red squares represent our experimental results for the [100] and [110] shock directions, 
respectively. The velocities of the dislocation propagation (vq) noted are obtained from the slope of the linear 
fits (solid lines). The error bars on each plot represent lo of multiple measurements. The shot-by-shot 
fluctuation of drive laser intensity is not included in the error bars shown on xg. The errors on vg are evaluated 
on the basis of the lo of the fitting and the errors on each plot. (B) va versus the diamond's material density 
(p) at the shocked states. Black curves are the calculated c), c2, and c3 of diamond propagating along [110] 
direction (18). The error bars on p are propagated from the errors on the plastic shock wave velocities (18). 


features in our x-ray radiography is strongest 
when the x-ray beam direction integrates along 
a plane perpendicular to the two-dimensional 
(2D) features formed in the diamond. 

We used Fourier filtering methods to em- 
phasize the linear image features (Fig. 2, C and 
F). These banded features in the [110] shock 
images appear thicker and darker than those 
seen in the [100] shock images. As described 
in the next section, the dislocation propaga- 
tion for the [110] shock is faster than that for 
the [100] shock, which potentially affects the 
thickness and darkness of the stacking faults 
as a faster dislocation propagation results in 
faster macroscopic plastic flow (38), causing 
more damage to the material. We measured 
the angles between the linear features and the 
diamond surface using Hough transforms 
on the Fourier-filtered images (18). The ob- 
served angles of the 16-ns images are 56° + 3° 
and 32° + 3° for [100] and [110] shocks, re- 
spectively. Whereas the value for [100] agrees 
with the angles between the diamond surface 
and the {111} plane of the undistorted diamond 
lattice, which is 55° (Fig. 2B), the observed an- 
gle for [110] shows a relatively larger deviation 
from the angle of the undistorted diamond 
lattice, which is 35° (Fig. 2E). Although the 
deviation is within the estimated error, we be- 
lieve this deviation suggests that the diamond 
shocked along [110], experiencing a plastic de- 
formation with some distortion caused by 
the uniaxial shock compression. We observed 
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bands along non-{111} directions only in the 
[100] shock image collected at 16-ns delay (Fig. 
2C, orange arrows). The angular offset may in- 
dicate shock-induced twinning or slip along an- 
other crystallographic plane, though we cannot 
exclude the possibility of the diamond crystal 
having an undesirable formation of a rotated 
grain during the synthesis. 

We interpret these deformation-induced bands 
as accumulated stacking faults (39) that form 
along the {111} planes. The {111} plane in dia- 
mond is known to be the dominant slip plane 
for dislocations. Previous shock-loading experi- 
ments in other brittle materials have reported 
the occurrence of shock-induced amorphous 
bands formed diagonally to the shock wave- 
front (40-44), but the estimated temperature 
in the shocked diamond in our work is less than 
20% of its melting temperature (78), which we 
interpret as too low for amorphization. The pos- 
sibility of phase transformation is also negligible 
because the applied stress in our experiments is 
far below the phase transition point (45, 46). 


Transonic dislocation propagation 


Elastic theory predicts that transonic disloca- 
tions can be generated at discontinuities in 
energy or displacement in a material, as would 
be present at a shock-wave front (15, 27). These 
conditions can cause a full dislocation to split 
into partial dislocations, leaving a stacking fault 
between them (47, 48). As the edges of a stack- 
ing fault are defined by partial dislocations 
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(49, 50), the leading edge of the stacking-fault 
extension that we observe gives the dislocation 
velocity in our experiment. In our images, the 
stacking faults formed in the diamonds appear 
as discontinuous lines, indicating that the dis- 
locations travel with the plastic wavefront, orig- 
inating from the laser-irradiated side of the 
diamond surface. A closer look at the [110] shock 
images (Fig. 1B) shows the absence of these 
bands at the center of the plastic wavefront. 
The reason for this observation is that most 
of the dislocations traveling with the plastic 
wavefront are propagated from the limited 
area of the diamond surface that is shocked 
by the drive laser (@ ~ 260 um). This indicates 
that the number of dislocations freshly created 
at the propagating plastic shock front is much 
less than those propagating from the diamond 
surface. 

We plot the length of the stacking faults that 
we observe in our images (#g) as a function of 
time delay after the shock initiation (¢) (Fig. 3A). 
Then we quantify the velocity of the partial dis- 
locations (vq) leading that stacking-fault ex- 
tension by fitting the points to a line with vg = 
dxq/dt from the measured #4-t relationship. We 
show the obtained dislocation velocities (Fig. 
3B) along with the sound-wave velocities (¢;, Co, 
and cs) of diamond at various densities (78) 
calculated from the pressure-dependent elas- 
tic constants at 1100 K from (57). The dislocation 
velocities that we observe for the [100] and 
[110] shock directions both lie in the transonic 
regime (i.e., between c, and c3). We find that 
the velocity of dislocations in the [110] shocked 
crystals is faster than those of the [100] crystal. 
Although the plastic wave velocities differ only 
slightly, the angle between the shock direction 
and dislocation propagation direction is larger 
for the [110] shock than the [100] shock, making 
the dislocation propagation distance longer (and 
thus faster) for the [110] to catch up with the 
plastic shock wavefront. 

The existence of supersonic dislocation has 
been observed in several MD simulations though 
some of the early theoretical studies predicted 
it to be impossible (27, 52). Because the shock- 
induced dislocation motion that we observe 
in diamond travel with the plastic shock wave- 
front, applying a higher shock stress to drive a 
faster plastic shock wave can potentially drive 
even faster dislocation motion. This will allow 
one to experimentally investigate the possible 
existence of supersonic dislocations in a real 
crystal. 


Radiation from transonic dislocations 


A close look at the elastically compressed vol- 
ume immediately preceding the plastic wave- 
front reveals the presence of multiple “elastic 
pulses” (Fig. 1B, yellow dotted line). These pulses 
are referred to as radiation (or phonon ra- 
diation) in dislocation theory (27). Although 
the emission of radiation from transonically 


3 of 4 


RESEARCH | RESEARCH ARTICLE 


moving dislocations has been predicted by dis- 
location theory and observed in MD simulations 
(9, 23, 24), a dearth of validation methodology 
has precluded experimental observation of the 
radiation. 

Although we observe the radiation feature 
in our images of the [110] shock (Fig. 2D), our 
images for the [100] (Fig. 2A) and [111] shock 
directions (fig. S9) show less clear signs of ra- 
diation. The different amounts of radiation 
that we observe for shocks along different di- 
rections with different dislocation velocities 
should be due to the strong velocity depen- 
dence on the energy dissipation in radiation, 
which has also been predicted by the theory 
and simulations (9, 53). Because the radiation 
from the transonically propagating dislocation 
cores exerts a negative driving force that adds 
drag to the dislocation motion, large external 
stress is required to drive the stable motion of 
transonic dislocations without deceleration 
(9). The use of strong shock waves can provide 
stress that is high enough to overcome the drag 
by radiation to achieve stable transonic mo- 
tion of dislocations. 

Although the existence of the radiation from 
transonically propagating dislocation cores 
and its role in the energy dissipation have been 
extensively studied, its effect on the shock-induced 
deformation dynamics remains largely unknown. 
A few studies have used MD simulations of 
shocked single-crystal aluminum (54, 55) to 
predict shock-induced elastic-plastic deforma- 
tion mechanisms wherein the preceding elas- 
tic volume was disturbed by the plasticity that 
followed. Their observations were explained by 
the elastic pulses emitted from the dislocations 
at the plastic wavefront, consistent with the con- 
figuration that we observe in this diamond 
study. A recent billion-atom MD simulation of 
single-crystal diamond shocked along the [110] 
axis revealed complex defect structures at a 
peak stress that was slightly higher than 400 GPa 
(56). Such dislocation-driven elements of shock 
deformation dynamics might explain the com- 
plex elastic-plastic correlations in shocked dia- 
mond (57) and the anomalous elastic behavior 
in shocked silicon (58) that were observed in 
previous experiments by using a conventional 
velocimetry. 

In conclusion, our in situ x-ray radiography 
using one of the world’s brightest x-ray lasers 
shows experimental evidence of transonic dis- 
location motion in diamond, leading to the 
formation of numerous stacking faults that 
span the plastically deformed volume. The mi- 
croscopic dislocation motions that emit radia- 
tion might affect the macroscopic elastic-plastic 
deformation dynamics. Our experimental re- 
sults showing transonic dislocation motion 
open key new opportunities to refine models 
for insights into the ultrafast deformation be- 
haviors at these extreme conditions. The newly 
refined models at the highest strain rates will 
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have a marked influence on numerous fields, 
including ultrafast fracture in structural ma- 
terials (69-61), prediction and analysis of earth- 
quake ruptures (62-65), precise manufacturing 
processes (66), and functionalities in electro- 
chemical applications (67). 
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HUMAN EVOLUTION 


Independent age estimates resolve the controversy 
of ancient human footprints at White Sands 
Jeffrey S. Pigati!*}, Kathleen B. Springer’, Jeffrey S. Honke!, David WahI2?, 


Marie R. Champagne?, Susan R. H. Zimmerman’, Harrison J. Gray’, Vincent L. Santucci®, 
Daniel Odess°+, David Bustos’, Matthew R. Bennett® 


Human footprints at White Sands National Park, New Mexico, USA, reportedly date to between ~23,000 
and 21,000 years ago according to radiocarbon dating of seeds from the aquatic plant Ruppia cirrhosa. 
These ages remain controversial because of potential old carbon reservoir effects that could compromise 
their accuracy. We present new calibrated ‘4C ages of terrestrial pollen collected from the same 
stratigraphic horizons as those of the Ruppia seeds, along with optically stimulated luminescence ages 
of sediments from within the human footprint-bearing sequence, to evaluate the veracity of the seed 
ages. The results show that the chronologic framework originally established for the White Sands footprints 
is robust and reaffirm that humans were present in North America during the Last Glacial Maximum. 


ncient footprints in White Sands Natio- 

nal Park, New Mexico, USA (WHSA), 

appear to place humans in North America 

during the Last Glacial Maximum (7). The 

footprints and associated tracks of Pleis- 
tocene megafauna at WHSA Locality 2 are 
imprinted on multiple stratigraphic horizons 
composed of fine-grained, gypsum-rich alluvium 
intercalated with clay and silt that was deposited 
in a mosaic of wet and dry environments along 
the eastern margin of Paleolake Otero in the 
Tularosa Basin (Fig. 1). Seeds from the aquatic 
plant Ruppia cirrhosa are found in situ within 
thin clay laminae, either interbedded with the 
footprint horizons or embedded in the foot- 
prints themselves. A total of 11 aliquots of 
Ruppia seeds yielded calibrated “C ages that 
maintain stratigraphic order and range from 
~23 to 21 ka (ka, thousand years before pres- 
ent; 0 years = 1950 CE) (J), constraining the 
human footprints to within the temporal bounds 
of the Last Glacial Maximum (~26.5 to 19 or 
20 ka) (2). 

The findings at WHSA Locality 2 push back 
the peopling of the Americas by thousands of 
years and imply that early inhabitants and 
megafauna coexisted for several millennia be- 
fore the terminal Pleistocene extinction event 
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(3). This has important consequences for ar- 
chaeology and allied fields, yet there has been 
widespread debate as to the accuracy of the 
Ruppia ages and therefore the antiquity of the 
footprints. This debate is organized around 
two lines of reasoning. The first is that R. 
cirrhosa is an aquatic plant, so there is the 
potential for uptake of old carbon from ground- 
water, which would cause the seed ages to be 
too old, a phenomenon known as the “hard- 
water effect” (4, 5). The second is that Ruppia 
seeds are physically robust and can be pre- 
served in sediments over geologic timescales, 
so it is possible that the seeds were exhumed 


4 


from older deposits and reworked into the ; oi 
ments at WHSA Locality 2. If so, the seeds wo — 
provide only maximum-limiting ages for the 
human footprints (6, 7). 

To address the controversy regarding the 
ages derived from Ruppia seeds, we obtained 
radiocarbon ages of terrestrial pollen recovered 
from the same stratigraphic intervals as those 
of the seeds, as well as optically stimulated 
luminescence (OSL) ages of quartz grains from 
within the footprint horizon interval, to evaluate 
the chronology of WHSA Locality 2. The dating 
techniques, the sample types, and the acceler- 
ator mass spectrometry facility involved in 
this study are independent of those used by 
Bennett ez al. (1). 

Terrestrial pollen is produced by cone-bearing 
and fruit-bearing plants living on land and 
fixing atmospheric carbon and therefore is 
not subject to hard-water effects. However, 
pollen grains are extremely lightweight and 
are generally between 10 and 150 um in diam- 
eter (8), meaning that tens or even hundreds 
of thousands of grains are required to achieve 
the mass necessary for a single radiocarbon 
measurement. Even though it is an exceptionally 
laborious process, researchers have attempted 
to date pollen for decades (9-17), but the results 
have been mixed largely because of the diffi- 
culty of isolating enough pollen grains from 
other organic material to produce reliable 
ages. The recent adoption of flow cytometry 
to separate pollen grains from sediment and 
other organic materials has overcome this 


Fig. 1. Ancient human footprints found at WHSA Locality 2. Photograph showing multiple in situ 
footprints in track horizons TH1 (white arrows), TH2 (yellow arrows), TH3 (red arrows), and TH4 (black 
arrows). Their stratigraphic positions and associated chronologic data are provided in Fig. 2. 
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problem, and pollen samples often approach 
purity after processing (18-27). 

In this study, we collected large (>1 kg) sam- 
ples of bulk sediment from the exact same 
stratigraphic levels as those of the original sam- 
ples of Ruppia seeds from WHSA Locality 2, 
which yielded ages that range from 22.87 + 
0.30 to 21.13 + 0.25 ka (Fig. 2) (7). The sam- 
ples were subjected to rigorous chemical pre- 
treatment and microsieving to remove as much 
of the host material as possible, and the pollen 
grains were separated from the remaining 
residue by means of flow cytometry (22). All of 
the WHSA Locality 2 samples contain pollen 
assemblages that are indicative of a climate 
that was much cooler and wetter than today 
(22). Arboreal taxa include abundant Pinus 
(pine) and some Picea (spruce), Abies (fir), and 
Pseudotsuga (Douglas Fir), which indicate al- 
titudinal lowering of nearby conifer forests 
compared with present day and are consistent 
with previous findings for the Last Glacial Max- 
imum (23). Nonarboreal taxa are dominated 
by Artemisia, reflecting a sagebrush steppe 
that is not found in the region today (22). For 
dating, we targeted conifer pollen because it is 
relatively large (>70 um) and has a relatively 
thick exine, both of which result in increased 
carbon content. We measured the “C content 
of three different samples from WHSA Locality 
2 that each contained ~75,000 pollen grains, 
and the resulting calibrated ages range from 
23.4 + 2.5 ka to 22.6 + 2.3 ka (Fig. 2) (22). 

The uncertainties associated with the pollen 
ages are relatively large because they required 
blank corrections that are approximately an 
order of magnitude larger than what is typically 
used in radiocarbon dating. Blank corrections 
are performed on all radiocarbon samples to 
account for small amounts of contaminant “C 
(usually modern) that is introduced to samples 
during the various stages of processing, includ- 
ing collection, handling, chemical pretreatment, 
CO, extraction, graphitization, and the isotopic 
measurement (24-26). To isolate the pollen 
grains, the samples analyzed in this study re- 
quired extensive pretreatment procedures, so 
in order to properly correct the resulting ages, 
we also extracted and dated pollen from sedi- 
ments that are known to be beyond the limit 
of radiocarbon dating for use as a procedural 
blank (22). Assuming that the same amount 
of contamination was introduced to all of the 
samples analyzed in this study, which is equi- 
valent to an increase of ~2200 “C years or 
2500 calendar years for the WHSA Locality 2 
sequence (22), the resulting ages should be 
viewed as being accurate, albeit with reduced 
precision. Overall, the calibrated “C ages of the 
pollen samples from WHSA Locality 2 are sta- 
tistically indistinguishable from the original seed 
ages (Fig. 3), which supports the original chro- 
nologic framework developed by Bennett et al. 
(1). Last, even had we not undertaken this 
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Fig. 2. Geologic context of chronologic samples. (A) Composite stratigraphy, calibrated “C ages, and 
sample numbers (in italics) of pollen and Ruppia seeds, as well as the minimum OSL age for sediments 
exposed in the trench at WHSA Locality 2. The calibrated “C ages were used to construct the age models 
shown in Fig. 3. (B) Photograph looking south of the sediments exposed in the trench at WHSA Locality 2 
along with locations of radiocarbon samples for Ruppia seeds (1a, 1b, and 1d) and pollen (laa and 1dd) as 
well as the three OSL samples (OSL-1, -2, and -3) (22). Pollen sample Iff is located near the base of the 


trench to the east (fig. Sl). The large block cut out of t 
sampled previously for U-Th series disequilibrium dati 


pollen-specific approach for the blank correc- 
tion, the calibrated pollen ages would still have 
fallen within the Last Glacial Maximum (22). 
In addition to radiocarbon methods, which 
are based on radioactive decay, we used OSL 
dating, which is based on the accumulation of 
energy stored as trapped charge within the 
crystal structure of a mineral and is a function 
of background ionizing radiation (27). This 
trapped charge is released and the geochro- 
nometer is reset when a mineral, typically quartz, 
is exposed to sunlight or heat. For the sedi- 
ments at WHSA Locality 2, an OSL age is 
equivalent to the amount of time elapsed since 
the quartz was deposited and sealed from sun- 
light. Although the sediments at WHSA Locality 


he exposed face on the left side of the photograph was 
ng (1). 


a small amount of quartz, we were able to ob- 
tain three OSL age minima from a single strat- 
igraphic level located just below our lowest 
radiocarbon age but still well within the human 
footprint horizons. Collectively, they show the 
minimum OSL age of the sampled horizon is 
>21.5 + 1.9 ka (22). As with the pollen results, 
the OSL ages are statistically indistinguishable 
from the calibrated 'C ages of the lowest 
Ruppia seeds collected for dating from the 
stratigraphic sequence (Fig. 2). 

Both the calibrated “C ages of pollen and 
the OSL ages support the original ages obtained 
from the Ruppia seeds at WHSA Locality 2, 
but the accuracy of the age estimates from 
each method must be evaluated individually 


2 are dominated by gypsum and contain only 


before they can be accepted as reliable. The most 
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Fig. 3. Age dating and A 
modeling results. Modeled 

ages for (A) track horizons TH1 
through TH7 and (B) the start 
and end boundaries for the 

entire footprint horizon interval 
based on the calibrated “Cc 

ages of pollen and Ruppia seeds. 
Both figures were generated 
using the Sequence function 

in OxCal v.4.4.4 (32) and the 
IntCal20 calibration curve (23), 
which we accessed online 

30 May 2023. Samples identified 
as outliers at the 95% confidence 
level were excluded from the 
analyses (22). 


Probability 09 


common pitfall when dating pollen is that re- 
working of pollen from older sediments could 
potentially cause the ages to be too old (19, 28). 
To evaluate this possibility, we dated multiple 
aliquots of pollen extracted from sediments 
collected at the surface of an active playa in 
White Sands National Park. The pollen assem- 
blage in the modern playa sediments is con- 
sistent with the warm, dry conditions that 
prevail in the Tularosa Basin today and is com- 
posed primarily of nonarboreal taxa, including 
Amaranthaceae, Asteraceae, and Poaceae (all 
flowering plants) (22). The ““C content of the 
playa conifer pollen (22) is indistinguishable 
from current atmospheric “C levels (29), which 
shows that reworking of old pollen is inconse- 
quential in this part of the Tularosa Basin to- 
day. It follows that this also holds true for the 
late Pleistocene, especially considering that 
much of the basin was covered by Paleolake 
Otero at that time. 

Similarly, processes that take place after sed- 
iments are deposited can potentially influ- 
ence OSL ages by affecting the background 
radiation field, causing them to be either too old 
or too young. Such phenomena include changes 
in pore water content, enrichment or depletion 
of radiogenic isotopes over time, changes in the 
flux of cosmogenic radiation, incomplete re- 
moval of trapped charge during transport 
(partial bleaching), and physical disturbance 
of the grains from bioturbation. For the WHSA 
Locality 2 samples, the impact of these varia- 
bles is likely minimal based on the consistency 
of the measured pore water content between 
the low-permeability, clay-rich sediment sam- 
ples; the similarity in radiogenic isotope val- 
ues between samples; ample Sun exposure 
during transport prior to burial; and the intact 
stratigraphy at the sample locations. 

Resolution of the White Sands dating con- 
troversy requires that there is no combination 
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of physical or chemical processes that could 
cause the ages of three independent chrono- 
logic data sets to converge on a single age range 
while simultaneously being incorrect or biased. 
It also requires that paleoenvironmental indi- 
cators (pollen assemblages) be consistent with 
regional records for the period in question. These 
metrics are met in this study on the basis of the 
nature of the chronologic techniques we used 
and the materials to which they were applied. 
Combined with the previously reported geo- 
logic, hydrologic, stratigraphic, chronologic, and 
climatic evidence (/, 30, 31), the congruence of 
the calibrated “C ages of the aquatic Ruppia 
seeds, the calibrated “C ages of the terrestrial 
pollen grains, and the OSL ages support the 
conclusion that humans were present in North 
America during the Last Glacial Maximum. 
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Hormone-mediated neural remodeling orchestrates 
parenting onset during pregnancy 


Rachida Ammari'+, Francesco Monaca’}, Mingran Cao’, Estelle Nassar’, Patty Wai?, 
Nicholas A. Del Grosso’, Matthew Lee’, Neven Borak!, Deborah Schneider-Luftman’, Johannes Kohl** 


During pregnancy, physiological adaptations prepare the female body for the challenges of motherhood. 
Becoming a parent also requires behavioral adaptations. Such adaptations can occur as early as 
during pregnancy, but how pregnancy hormones remodel parenting circuits to instruct preparatory 
behavioral changes remains unknown. We found that action of estradiol and progesterone on galanin 
(Gal)-expressing neurons in the mouse medial preoptic area (MPOA) is critical for pregnancy-induced 
parental behavior. Whereas estradiol silences MPOA®' neurons and paradoxically increases their 
excitability, progesterone permanently rewires this circuit node by promoting dendritic spine formation 
and recruitment of excitatory synaptic inputs. This MPOA“"-specific neural remodeling sparsens 
population activity in vivo and results in persistently stronger, more selective responses to pup stimuli. 
Pregnancy hormones thus remodel parenting circuits in anticipation of future behavioral need. 


otherhood is associated with pro- 

nounced behavioral changes in many 

species, such as altered feeding rou- 

tines and increased aggressivity (J-9). 

These adaptations are typically attri- 
buted to the hormonal changes associated 
with giving birth (parturition), which have been 
hypothesized to activate or prime parental cir- 
cuits (10, 11). One of the most notable differences 
between sexually inexperienced (virgin) females 
and mothers is their infant-directed behaviors: 
Whereas virgins typically avoid infants or exhibit 
low levels of parental behavior, mothers are 
highly parental (12-14). Classical studies in 
rats have found increased maternal responsive- 
ness during pregnancy (9, 15, 16). This even 
occurs in females undergoing a caesarean sec- 
tion during mid- or late pregnancy and per- 
sists for weeks (15, 17-20). Correspondingly, 
parental behavior can be elicited in virgin rats 
by mimicking the hormonal changes of preg- 
nancy (21-27), which include drastic rises in 
the levels of estradiol (E2) and progesterone 
(P4) (fig. SIA). These observations indicate 
hormone-mediated, preparatory neural adap- 
tations to infant-directed behavior during preg- 
nancy. However, despite the identification of 
numerous forms of pregnancy-associated neu- 
ral plasticity (1, 28, 29), it remains unknown 
how pregnancy hormones affect parenting 
circuits to mediate changes in infant-directed 
behavior. 


Hormone-dependent, long-lasting changes to 
pup-directed behavior in pregnancy 


Whereas virgin female rats and wild house 
mice typically ignore or attack pups, respectively, 
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virgin female laboratory mice often exhibit 
hormone-independent, spontaneous parental 
behavior (9). We first investigated when and 
how pup interactions change during pregnancy. 
We exposed female mice to pups at regular 
intervals before, during, and after pregnancy 
and scored their behaviors (Fig. 1A, Preg). Most 
aspects of parental behavior were affected by 
pregnancy (Fig. 1, D to G, and fig. S1, B to J), 
and this was particularly pronounced in late 
pregnant females [day 18 (D18)]: All D18 females 
retrieved pups with short latency (D18, 39.7 + 
10.8 s; virgins, 477.9 + 143.3 s; Fig. 1, D to G), 
crouched above pups (17.3 + 3.5% of assay 
duration), and spent most of their time in the 
nest (fig. S1, B and C). In addition to individual 
aspects of parenting, pregnancy affected behav- 
ioral sequences: Whereas D18 females per- 
formed sequences of retrieval, crouching, nest 
building, and grooming, virgins engaged in 
repetitive sniffing-grooming-nest entering epi- 
sodes (Fig. 1H). The increased parental per- 
formance of D18 females could be a result of 
hormonal effects and/or frequent pup expo- 
sure (30). We therefore assessed pup interac- 
tions in females that were exposed to pups 
only as virgins and at D18 (Fig. 1B, Dwal) and 
in ovariectomized females (Fig. 1C, OVX) (37). 
Pup retrieval, crouching, and time in nest dif- 
fered between virgins (Vir) and D18 females in 
the Preg and Dual groups. By contrast, such 
differences were not present in the OVX group 
over similar time points and are thus primarily 
affected by pregnancy hormones (Fig. 1, D to G, 
and fig. S1, B to E). These behaviors were also 
up-regulated in females that were exposed to 
pups only once, at D18, which illustrates that 
the pregnancy-induced onset of parenting does 
not require any previous pup exposure (fig. S1, 
Nto S). The hormonal milieu of pregnancy thus 
leads to an onset of specific parental behaviors 
in mice, and these behavioral changes are maxi- 
mal in late pregnancy. Most behavioral changes 
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am 
persisted until at least 1 month after partur: ore 
(D50) (Fig. 1, D and F, and fig. SID), wo. 
hormone levels have returned to baseline (fig. 
S1A). These adaptations thus likely result from 
long-lasting remodeling of the brain by preg- 


nancy hormones. 


Hormone action on MPOA“! neurons is 
critical for parenting onset 


Parenting is controlled by brain-wide circuits 
(32-36), several elements of which might be 
affected by hormones. In particular, the medial 
preoptic area (MPOA)—which is critical for 
parental behavior—has been shown to be a 
hormonal target (37-39). Parental behavior can 
be induced in virgin female rats by administra- 
tion of the hormones E2, P4, prolactin, and 
oxytocin (21-26, 40, 41), and global knockout 
(KO) of their canonical receptors impairs 
parenting (42-45). Because combined systemic 
administration of E2 and P4 is most effective 
in triggering parenting onset (72), the under- 
lying neural substrates are likely sensitive to 
both hormones. E2 and P4 can permanently 
modulate neuronal function through intra- 
cellular receptors that act as transcription 
factors (46-49). We focused on the intra- 
cellular estrogen receptor 1 (Esr1) and pro- 
gesterone receptor (PR) because they are 
critical for parental behavior (43, 50-52) and 
because the long-lasting nature of pregnancy- 
induced behavioral changes implicates gene 
expression-dependent forms of plasticity. 
Using single-molecule fluorescence in situ 
hybridization (smFISH) from hypothalamic 
brain sections of virgins and D18 females, we 
found that Esr1-PR coexpressing neurons (as 
well as neurons expressing either receptor) were 
most enriched in MPOA subregions (fig. S2, A 
to F). Prolactin receptor, but not oxytocin re- 
ceptor, expression was similarly enriched in 
the MPOA (fig. S2, G and H). 

To determine whether Esr1 or PR expression 
in the MPOA is required for parenting onset 
during pregnancy, mice carrying floxed re- 
ceptor alleles (materials and methods) (53) 
were injected into the MPOA with an adeno- 
associated virus (AAV)-expressing Cre recom- 
binase (Fig. 11. This resulted in local receptor 
KO, whereas injection of a green fluorescent 
protein (GFP)-expressing control AAV did not 
affect receptor expression (fig. $3, A to H). 
MPOA-specific ablation of either Esr1 or PR had 
no effect on pup interactions in virgins but 
completely blocked the pregnancy-induced up- 
regulation of pup retrieval, crouching, and nest 
time at D18 (Fig. 1, J to L, and fig. S4, A to C). By 
contrast, parental behaviors were normally up- 
regulated at D18 in animals injected with con- 
trol AAVs (Fig. 1, J to L, and fig. $4, A to D). 

Several overlapping populations of MPOA 
neurons are involved in the control of parent- 
ing, with galanin-expressing (MPOA“) neurons 
being critical for this behavior (33-35, 54-56). 
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Fig. 1. Hormone action on MPOA“' neurons is critical for pregnancy-induced 
onset of parental behavior. (A to C) Testing pup-directed behavior in repeatedly 
pup-exposed pregnant females (Preg, n = 10) (A), pregnant females exposed to 
pups twice (Dual, n = 9) (B), and repeatedly pup-exposed ovariectomized females 
(OVX, n = 10) (C). Day of pregnancy [(A) and (B)] or relative to pairing with male 
(C) are shown. (D and F) Parental behaviors in Preg group. Within-group [Preg, virgin 
(Vir) versus each subsequent time point; red asterisks] and between-group (Preg 
versus OVX; black asterisks) comparisons are shown. (E and G) Comparison of Vir and 
D18 time points across groups. Note that virgins from Preg and Dual groups are pooled 
(fig. SIM). (H) Behavioral state transition diagrams for Vir and D18 females (Preg, 


MPOAS! neurons, most of which express Esr1 
and PR (fig. S3, J and K), constitute ~20% of 
MPOA neurons (35). To determine whether 
hormonal sensitivity of this subpopulation is 
necessary for pregnancy-induced behavioral 
adaptations, we made a knock-in mouse line 
expressing Flp recombinase in Gal neurons 
(fig. S3I and materials and methods) and 
crossed this allele into mice with floxed recep- 
tor genes. AAV-mediated ablation of either 
Esr1 or PR in MPOAS! neurons fully re- 
capitulated the effects observed after MPOA- 
wide receptor KO (Fig. 1, M to P, and fig. S4, 
E to H). By contrast, pup contact latency—a 
parameter not modulated by pregnancy—was 
not affected by this manipulation (fig. S41). 
The parental behaviors of these receptor- 
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ablated animals remained impaired even af- 
ter giving birth (D22; fig. S5), which indicates 
that the lack of hormone-mediated behavioral 
preparations during pregnancy cannot be com- 
pensated for by the subsequent endocrine events 
of parturition. Direct action of E2 and P4 on 
MPOA“ neurons through their intracellular 
hormone receptors is therefore necessary 
for pregnancy-mediated increases in parental 
behavior. 


Long-lasting hormonal remodeling of 
MPOA“"' neurons during pregnancy 


We next investigated how pregnancy affects 
MPOA“" neurons and performed patch clamp 
recordings in brain slices from virgins and D18 
females (Fig. 2A, top). Recorded neurons were 
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n= 10). Average transition probabilities (P7) between behaviors are shown, and 
differences between Vir and D18 females are highlighted if P < 0.05 (U test; see 
materials and methods). retr., retrieval. (I) AAV-mediated ablation (KO) of Esrl or PR in 
MPOA and control (ctrl). (J to L) Effects of MPOA-wide KO of Esr1 or PR on pup- 
directed behaviors (n =7, 8, and 9 mice). (M) KO of Esrl or PR in MPOAS*' neurons. 
(N to P) Effects of MPOA*'-specific KO of Esr1 or PR on pup-directed behaviors 
(n = 8, 5, and 13 mice). Statistics by Kaplan-Meier survival analysis with log rank 
test in (D), (E), (J), and (N) and by Fisher's exact test with Benjamini-Hochberg 
adjustment for multiple comparisons in (F), (G), (K), (L), (0), and (P). Shaded area 
in (D) is SEM. ***P < 0.001; **P < 0.01; *P < 0.05. 


neurobiotin-filled and reconstructed to assess 
morphological changes. MPOA“ neurons ex- 
hibited lower baseline firing and resting mem- 
brane potential in late pregnancy (Fig. 2, B and 
C, and supplementary text), with a significantly 
higher proportion of silent neurons at D18 (fig. 
S6, A and B). This silencing was abolished by 
Tertiapin-Q (TQ) and might thus be mediated 
by GIRK channels (Fig. 2C). At the same time, 
MPOA“ neurons were more excitable at D18 
and less frequently exhibited depolarization 
block (Fig. 2, D and E, and fig. S6K). We ob- 
served a reduction in action potential half-width 
at D18 (fig. S6E), which hints at increased 
function of delayed rectifier K* channels that 
repolarize neurons to permit sustained firing 
(57). These effects on neuronal membrane 
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Fig. 2. Hormonal remodeling of MPOA*! neurons. (A) Whole-cell recordings from wild-type (top) and 
receptor-deleted (KO) (bottom) MPOA' neurons. (B) Cumulative distribution of baseline firing frequency 
(Vir and D18; 33 and 21 cells from n = 15 and 7 mice). (C) Resting membrane potential of control and 
receptor-deleted MPOA®*! neurons and recordings in presence of GIRK channel blocker Tertiapin-Q (Tert-Q) 
(38, 32, 15, 18, and 26 cells from n = 15, 9, 3, 3, and 5 mice). (D) Example current clamp recording traces 
of cells with (Vir) and without (D18) depolarization block. (E) Percentage of neurons exhibiting depolarization 
block (34, 30, 18, and 25 cells from n = 15, 8, 3, and 5 mice). (F) Example voltage clamp recording traces with 
sPSCs. (G) sPSC frequency (21, 23, 18, and 26 cells from n = 9, 6, 3, and 5 mice). (H and I) Excitatory 
postsynaptic current (EPSC) (Vir and D18; 31 and 30 cells from n = 5 and 4 mice) (H) and inhibitory 


postsynaptic current (IPSC) (31 and 28 cells from n = 5 and 4 mice) (I) frequency. 
segments of MPOA“*' neurons with spines. (K) Spine density (14, 10, 8, and 15 cells from n = 


(J) Dendri 


tic 
10, 4, 3, 


and 4 mice). (L) Summary scheme for hormonal remodeling of MPOA°*' neurons. Statsitics by U test in 


(B); one-way analysis of variance (ANOVA) with Dunnett's post hoc test in (C), (G 


, and (K); 


Fisher's 


exact test with Benjamini-Hochberg adjustment in (E); and K-S test in (H) and (I). Scale bars, 20 wm 
(A) and 10 um (J). ***P < 0.001; **P < 0.01; *P < 0.05. 


properties were already apparent in midpreg- 
nancy (D10; fig. S6, B to E) and were linked 
because 80% of silent MPOA% neurons also 
did not exhibit depolarization block at D18 
(fig. S60). MPOA®! silencing was not a re- 
sult of increased inhibitory synaptic inputs: 
Although these neurons received more spon- 
taneous postsynaptic currents (sPSCs) at D18, 
this was because of an increase in excitatory 
inputs (Fig. 2, F to I, and fig. S6M), which pre- 
dominantly targeted spontaneously active— 
i.e., nonsilenced—neurons (fig. S6N). Corre- 
spondingly, MPOA“ neurons had more den- 
dritic spines at D18 (Fig. 2, J and K). This 
remodeling of synaptic inputs was also already 
detectable at D10 (fig. S6, I and J). We did not 
observe changes to sPSC amplitude and dy- 
namics (fig. S6L) or to dendritic complexity and 
somatic volume (fig. S6, P and Q) (58). Preg- 
nancy therefore reduces the baseline activity 
of MPOA“ neurons while increasing their ex- 
citability and promoting the recruitment of 
excitatory synaptic inputs. Pregnancy did not 
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have equivalent effects on Gal-negative MPOA 
neurons, which highlights the specificity of 
MPOA@ neuronal remodeling (fig. $7). 

To address whether these biophysical and 
morphological changes were the result of direct 
hormonal action, we recorded from MpoAc 
neurons in which Esr1 or PR were deleted (Fig. 
2A, bottom). Ablation of these receptors re- 
turned specific, nonoverlapping aspects of D18 
neuronal physiology to a virgin-like state: Esr1 
deletion specifically prevented pregnancy- 
induced silencing and changes to excitability 
(Fig. 2, C and E) but did not affect synaptic 
inputs and spine density (Fig. 2, F, G, J, and K). 
By contrast, PR deletion selectively abolished 
the up-regulation of synaptic inputs and spine 
density (Fig. 2, G and K) without affecting 
membrane properties (Fig. 2, C and E). Trans- 
duction with a control AAV had no effect (fig. 
S6, R to W). E2 and P4 therefore control dis- 
crete aspects of pregnancy-induced plasticity 
in MPOA“ neurons: Whereas E2 tunes mem- 
brane potential and intrinsic excitability, P4 
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mediates the recruitment of additional excit- 
atory synaptic inputs (Fig. 2L). To assess how 
long-lasting these changes were, we recorded 
from MPOA“" neurons in mothers shortly after 
parturition (D22) and at D50, when pregnancy- 
and parturition-associated hormone levels have 
returned to baseline (fig. S8A). MPOA@ rest- 
ing membrane potential and firing frequency 
remained reduced at D22 and only returned 
to virgin-like levels at D50 (fig. S8, B and C), 
whereas neuronal excitability reverted imme- 
diately after parturition (fig. S8, E and F). By 
contrast, synaptic inputs and spine density 
showed a long-lasting up-regulation (fig. S8, 
G and H). Similar to the lasting behavioral 
effects of receptor ablation, its physiological 
effects persisted in mothers (fig. S8, I to 
K). These observations suggest that preg- 
nancy hormones permanently alter the circuit 
integration of Mpoa! neurons, thereby pro- 
viding a cellular substrate for the long-lasting 
behavioral effects of pregnancy. 


Reorganization of MPOAS' neuronal and 
neural population activity during pregnancy 


We next investigated the effects of pregnancy 
on MPOA® neural activity in vivo. We per- 
formed longitudinal, cellular-resolution calcium 
imaging from MPOA“ neurons in females ex- 
posed to pups and a set of social and nonsocial 
stimuli (Fig. 3, A to C, and fig. S9A) (59). Con- 
sistent with the silencing observed in our slice 
physiology recordings, the number of detecta- 
ble (nonsilent) MPOA“ neurons was signif- 
icantly reduced at D18 in vivo (Fig. 3, D and E). 
This reduction was not the result of a decline 
in the number of GCaMP-expressing Mpoac! 
neurons over time or increased calcium buf- 
fering by rising GCaMP levels because it was 
reversible (Fig. 3E) and did not occur in virgin 
females recorded at identical time points (fig. 
S9B). It also did not result from potential shifts 
in the recording plane because we observed 
this effect when imaging ex vivo (fig. S9, E 
and F). Finally, the number of detected neurons 
was not significantly decreased in MPOA-wide 
recordings (fig. S9, C and D). Pregnancy-induced 
silencing therefore preferentially occurs in 
Mpoac! neurons, consistent with our electro- 
physiological findings in brain slices (fig. S7, 
A to E). 

The fraction of neurons activated during pup 
retrieval and pup grooming decreased at D18 
(Fig. 3, F and G, and fig. S1OA). By contrast, 
similar fractions of MPOA“ neurons were 
active during pup sniffing in virgins and at 
D18, but their responses occurred with shorter 
latency at D18 and D50 (Fig. 3, I to K), which 
indicates a higher excitability of MPOA“ neu- 
rons to pup stimuli during and after preg- 
nancy. Pregnancy therefore sparsens MPOoA! 
population activity during parental actions 
and makes these neurons more excitable to 
pup stimuli. The baseline activity of individual 
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Fig. 3. Reorganization of MPOA“' population activity during pregnancy. 
(A) Recording setup for miniature microscope recordings. (B) Gal-Cre animals 
were injected into the MPOA with AAV-FLEx-GCaMP7s and implanted with a 
P7s expression and GRIN lens position are shown. (C) Experimental 
design (see materials and methods). (D) Sample recording frames with detected 
neurons and example activity traces from a virgin. (E) Number of detected (nonsilent) 
neurons per animal (n = 5 mice). (F and I) Temporal profile of MPOA°*' responses 
during pup retrieval (F) or sniffing (1) in virgins, at D18, and at D50 (162, 77, and 

93 neurons from n = 5 mice). Dashed lines indicate action onset. Order was 
chical clustering sorted by mean cluster response onset. (G and 
J) Fraction of neurons with positive evoked response during pup retrieval (G) 

H and K) Averaged Z score for neurons activated 


GRIN lens. GCa 


based on hiera 


or sniffing (J) (n = 5 mice). 
during pup retrieval in virgins, at D18, and at D50 (115, 


5, 5, and 4 mice) (H) or sniffing (122, 51, and 86 neurons from 5, 5, and 4 mice) 
(K). Statistics by two-way ANOVA with Tukey post hoc test, and gray bars indicate 


periods of significant difference for Vir versus D18 and Vir 


MPOA“ neurons was negatively correlated 
with their tuning to pup stimuli at D18, thereby 
linking neuronal silencing to stronger pup- 
evoked responses (Fig. 3L). To address pregnancy- 
induced differences in how MPOA@ neurons 
represent pup stimuli, we examined their ac- 
tivity patterns during chemoinvestigation of 
pups and other stimuli (Fig. 3C). MPOAG! 
neuronal stimulus selectivity for, and response 
strength to, pups increased in late pregnancy 
(Fig. 3M and fig. S11, A to E). Similarly, al- 
though linear discriminant analysis (LDA) 
could not separate pup representations in 
MPOA@! neurons well from those of other 
stimuli in virgins, separability of pup repre- 


sentations from those of other stimuli was en- 
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41, and 63 neurons from 


versus D50. (L) Correlation 


hanced at D18 (Fig. 3, N and O). Stimulus 
separability was positively correlated with 
population sparsening, thereby linking improved 
pup representations with effective encoding of 
parental actions (Fig. 3P). At D50, the numbers 
of spontaneously active and retrieval-activated 
neurons had largely returned to virgin levels 
(Fig. 3, E and G), mirroring our findings in 
brain slices (fig. S8, B to D). By contrast, pup 
stimulus selectivity and separability showed 
a long-lasting increase (Fig. 3, M to O). These 
findings demonstrate that pregnancy leads to 
a pronounced sparsening of spontaneous and 
parenting-associated activity in MPOA“ neu- 
rons and to increased selectivity for infant 
stimuli. 
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between normalized tuning index for responses to pup sniffing and normalized mean 
baseline activities at D18 (coefficient of determination r* = 0.202: P < 0.001). 
(M) Selectivity of chemoinvestigation-associated responses for indicated stimulus 
pairs at Vir, D18, and D50 (142, 35, and 108 cells from n = 4, 3, and 4 mice) compared 
with pups. A selectivity score of 1 means the neuron is only activated during pup 
sniffing, a score of O means selective activation during sniffing of other stimulus, 
and a score of 0.5 equals a nonselective response (see materials and methods). 

(N) Example MPOAS*' neuronal activity at Vir and D18 during object investigation 
in LDA space (int, intruder; obj, object). Temporal bins were 
Ellipsoids represent 95% confidence area of neuronal activity to each stimulus. 
(O) Separability of indicated stimulus combinations by the 
[Rand Index (RI), n = 4, 3, and 4 mice]. (P) Correlation between separability 
(pup versus intruder) and activated fraction of neurons during pup retrieval (r° = 
0.56; P < 0.001). Statistics by paired t tests in (E), (G), and (J); mixed linear 
model with mouse ID as group in (M); linear regression in (L) and (P); and unpaired 
t tests in (0). Scale bar in (B), 500 um. ***P < 0.001; **P < 0.01; *P < 0.05. 


used as features. 


POA" population 


Discussion 

Considerable progress has been made in un- 
covering the functional architecture of parent- 
ing circuits (9, 32-36), but little is known about 
how hormones alter these circuits to ensure 
state-dependent behavioral flexibility. We dis- 
covered that pregnancy hormone action on 
MPOA“ neurons—a hub in parenting circuits— 
is critical to instruct a preparatory change in 
infant-directed behavior. The ovarian hormones 
E2 and P4 each control distinct aspects of 
pregnancy-induced neural remodeling: Whereas 
E2 transiently silences MPOA“ neurons and 
increases their excitability, P4 permanently re- 
models this circuit element by recruiting synap- 
tic inputs. This results in sparsened population 
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activity during parental behavior and in poten- 
tiated, more-selective responses to pup stimuli. 
We propose that the resulting increase in 
signal-to-noise ratio, both in individual neu- 
rons and at the population level, enables more 
efficient encoding of parental motor actions 
by MPOA“ neurons. Population sparsening 
through silencing might contribute to setting 
up the circuit for efficient parental behavior by 
selectively recruiting inputs onto active MpPoAc 
neurons during pregnancy. Once rewired, this 
circuit could then drive robust parenting in 
response to pup cues, whereas release from 
silencing during the postpartum period might 
allow for recruitment of these neurons during 
nonparental social interactions. 

The long-lasting, P4-mediated remodeling 
of MPOAS synaptic inputs provides a cellular 
correlate for the long-lasting behavioral changes 
that we observe. Although parturition-associated 
hormonal changes and subsequent maternal 
experience cannot compensate for lack of hor- 
monal remodeling during pregnancy, these 
factors might normally contribute to long-term 
enhancement of maternal behavior (maternal 
memory) (9). Repeated and/or prolonged co- 
housing of virgins with pups results in elevated 
levels of parenting through sensitization (60, 61), 
and parental care can be socially transmitted 
by mothers (62). It is unclear whether these 
paradigms result in similar neuronal changes. 
Ablating MPOA@" neurons or making them 
hormone-insensitive both abolish pup retrieval, 
but optogenetic activation of these neurons 
elicits pup grooming in virgins (35). Although 
it remains unknown which neuronal ensembles 
are recruited by artificial, acute stimulation, 
they seem to differ from the sparse populations 
that drive robust parenting in late pregnancy. 

E2 silences MPOAS neurons beyond par- 
turition, presumably by up-regulating GIRK 
channel expression, whereas the more tran- 
sient increases in excitability are likely a result 
of potentiated function of delayed rectifier 
K* channels (57). The identity of the addition- 
al excitatory inputs recruited by P4 remains 
unknown. They might constitute long-range 
afferents conveying pup sensory informa- 
tion because most local MPOA neurons are 
GABAergic [y-aminobutyric acid (GABA)] (54). 
Future work will characterize the identity and 
functional role of the cellular pathways tar- 
geted by Esrl and PR. Coexpression of these 
receptors is not unique to Gal-expressing MPOA 
neurons (fig. S2C). We hypothesize that permis- 
sive chromatin states in these neurons allow 
for cell type-specific, hormonally induced tar- 
get gene expression. MPOA“! neurons form 
molecularly distinct subpopulations (54) that 
might be differentially affected by pregnancy 
hormones. We also expect MPOA“ and other 
neurons in parenting circuits to be sensitive to 
additional pregnancy hormones, such as pro- 
lactin, placental lactogens, allopregnanolone, 
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and oxytocin. Prolactin for instance, which 
acts on the MPOA in early pregnancy to re- 
duce physical activity (63), might also contrib- 
ute to early changes in pup-directed behavior 
(Fig. 1, D and F). 

Unlike laboratory mice, the majority of wild 
virgin female mice exhibit infanticide (64). Our 
work provides mechanisms through which hor- 
mones might act in parental circuits of wild 
mice and other species that critically depend 
on endocrine changes for the onset of short- 
latency maternal behavior, such as rats, rab- 
bits, and sheep (9). The neural activity changes 
observed in this work—i.e., population spars- 
ening and increased stimulus selectivity and 
discriminability—are reminiscent of changes 
that occur during critical periods in the devel- 
oping brain (65). Our work therefore suggests 
that pregnancy hormones open a window of 
adult plasticity during which neural remod- 
eling orchestrates behavioral adaptations for 
the future challenges of motherhood. 
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A quantum ruler for orbital magnetism in 


moiré quantum matter 


M. R. Slot’?+, Y. Maximenko"+, P. M. Haney’, S. Kim™, D. T. Walkup’, E. Strelcov"’, Son T. Le’, 
E. M. Shih?, D. Yildiz*4, S. R. Blankenship’, K. Watanabe®, T. Taniguchi®, Y. Barlas’, N. B. Zhitenev’, 


F. Ghahari®*, J. A. Stroscio™ 


For almost a century, magnetic oscillations have been a powerful “quantum ruler” for measuring 
Fermi surface topology. In this study, we used Landau-level spectroscopy to unravel the energy-resolved 
valley-contrasting orbital magnetism and large orbital magnetic susceptibility that contribute to the 
energies of Landau levels of twisted double-bilayer graphene. These orbital magnetism effects led to 
substantial deviations from the standard Onsager relation, which manifested as a breakdown in scaling 
of Landau-level orbits. These substantial magnetic responses emerged from the nontrivial quantum 
geometry of the electronic structure and the large length scale of the moiré lattice potential. Going beyond 
traditional measurements, Landau-level spectroscopy performed with a scanning tunneling microscope 
offers a complete quantum ruler that resolves the full energy dependence of orbital magnetic 


properties in moiré quantum matter. 


oiré quantum matter (MQM) systems 

(2) consist of stacked and twisted layers 

of van der Waals materials and have 

emerged as versatile condensed-matter 

quantum simulators (2). The twist 
angle, choice of material, number of layers, and 
the application of electric and magnetic fields 
provide a vast arena for realizing quantum 
phases that result from the interplay between 
electron correlations and topology. The dis- 
covery of flat electronic bands that host super- 
conductivity and correlated insulating states in 
magic-angle twisted bilayer graphene (MATBG) 
inspired rapid exploration of the parameter 
space (3-5). The most recent endeavors have 
focused on heterostructures of three, four, and 
five layers of alternating twisted graphene 
monolayers (6-11). 

The moiré systems of MATBG and related 
heterostructures support topological bands with 
nonzero Chern number (4, 12), which is de- 
rived from the Berry curvature of the Bloch 
wave functions (73). Berry curvature is intimately 
related to orbital magnetization, and indeed, 
previous work on MATBG and related systems 
has observed orbital magnetic order, valley Hall, 
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and quantum anomalous Hall effects (14-16). 
Berry curvature and the related quantum met- 
ric also provide significant contributions to the 
orbital magnetic susceptibility (17, 18). The 
valley-contrasting orbital magnetism and or- 
bital magnetic susceptibility play a central role 
in the material response to applied magnetic 
fields. Landau-level (LL) spectroscopy is a well- 
established tool to experimentally deduce the 
zero-field properties of the band structure, 
and in this work, we expand its application 
to extracting higher-order magnetic response 
functions. 

Semiclassically, LLs are described by the 
Onsager relation, relating the extremal cross- 
sectional area of the Fermi surface to the pe- 
riod of oscillations in the de Haas-van Alphen 
effect (19). The orbital magnetism and orbital 
magnetic susceptibility give rise to energy- 
dependent first- and second-order corrections 
with magnetic field in the Onsager relation 
(20), which was recently formulated as a 
general expansion in higher-order response 
functions (21). To determine these effects and 
their full energy dependence, high-resolution 
measurements of the LLs in the electronic 
bands are required: a “quantum ruler” prob- 
ing the evolution of LLs at all energies as a 
function of displacement field and magnetic 
field. 

In this work, we used LL measurements as a 
quantum ruler for twisted double-bilayer graph- 
ene (TDBG) to quantitatively determine the 
tunable electronic structure, energy-dependent 
valley-contrasting orbital magnetism, and or- 
bital magnetic susceptibility. TDBG is MQM 
with highly reconfigurable bands; the Bernal 
bilayer constituents have an electrostatically 
tunable band structure in themselves (22-37). 
Flat bands exist over a wide range of small 
twist angles (8), from about 0.8° to 1.5° Pre- 
vious transport and scanning tunneling micros- 


am 


copy (STM) measurements have focusec ore 
correlated states, which form at various pal. 
fillings (22-30), or density wave states at larger 
twist angles of 0 ~ 2.4° (37). Our intermediate 
twist angle of 1.74° gives rise to moiré mini- 
bands that are narrow but not extremely flat. 
This slightly wider bandwidth allows LLs to be 
resolved over the entire band structure, which 
has not been observed in previous local probe 
measurements (29). 


Tuning the electronic structure of TDBG 


Figure 1A shows the simulated band structure 
of TDBG at 0 = 1.75° with the Bistritzer- 
MacDonald (BM) continuum model (3). The 
moiré periodic potential leads to narrow 
moiré minibands, composed of two low-energy 
bands—the valence V1 and conduction Cl 
bands—and higher remote energy bands—V2 
and C2. The narrow bands V1 and Cl have a 
bandwidth of about 50 meV and are isolated 
from other bands (Fig. 1B). The TDBG system 
displays a large tunability in its electronic 
structure (12, 32-39). A perpendicular electric 
field induces a potential difference between 
the layers of TDBG, drastically altering the V1 
band structure. The V1 band is relatively flat at 
zero displacement field (D) (Fig. 1B) and de- 
velops pronounced electron and hole pockets 
at D = O (Fig. 1C and fig. S8). In the next 
section, we first describe the measurement 
of this band structure evolution under a dis- 
placement field. 

Measurements were performed in a custom- 
built dilution refrigerator-based scanning probe 
system operating at 10 mK (40, 41). Figure 1D 
shows the STM topograph of the TDBG sam- 
ple used in this study (fig. S1). The measured 
moiré periodicity of 4 = (8.1 + 0.1) nm cor- 
responds to a twist angle of 0 = (1.74° + 0.02°), 
and the local heterostrain was found to be 
<0.1%, as determined from atomically resolved 
spatial measurements (42). Scanning tunnel- 
ing spectroscopy (STS) measures the differen- 
tial tunneling conductance (dJ/dV) signal, which 
is proportional to the local density of states 
(LDOS) (43). For TDBG, the measured d//dV 
reflects the LDOS of the topmost layer of the 
four-layer system. The finite tip potential 
leads to a fixed top gate V;, determined by the 
contact-potential difference between probe 
and sample. Varying the back-gate voltage Vq 
simultaneously tunes the displacement field 
D and the carrier density n following D 
CaVe = CrVr and n CaVa + CrVz, where 
Cq and Vg are the back-gate capacitance and 
voltage, and Cy and Vy are tip capacitance 
and potential, respectively (42). We mapped 
the differential tunneling conductance as a 
function of sample bias Vz, the displacement 
field D, carrier density n, and magnetic field 
B. Figure 1E shows the experimental LDOS 
map at B = 0 T. The low-energy bands V1 and 
Cl and remote bands V2 and C2 and their 
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U =O meV 


E (meV) 


field at 6 = 1.75°. (A) Energy-band structure of TDBG showing the four 
low-energy bands computed by using a continuum model with interlayer 
potential U = 0 meV and @ = 1.75° Line cuts from the energy bands in (A) along 
Bloch wave vector k, = O for (B) U = 0 meV and (C) U = 80 meV. A dashed 
horizontal line indicates the position of the Fermi level (Er), which moves through 
the bands as the displacement field and carrier density are varied. The x-axis 
labels (g, k', m, and k) denote high-symmetry points in the mini-Brillouin 
zone. (D) STM topographic image of the TDBG moiré pattern, with moiré 


evolution with displacement field are clearly 
visible. The experimental data qualitatively 
agree with the modeled LDOS of the top layer 
of the four-layer graphene system obtained 
with continuum-model calculations [Fig. 1F 
(42)]. The calculations in this work are pre- 
sented for qualitative comparison and are 
not meant for quantitative comparison with 
the experiment. 

The trends in the d//dV map can be quali- 
tatively understood by considering how the 
theoretically predicted bands in Fig. 1, B and C, 
fill with n as the Fermi level is raised from 
negative to positive energies while the bands 
are modified by the displacement field D. At 
negative displacement field approaching D = 0, 
the Fermi level is initially pinned to the flat 
portions of V1 with its associated high LDOS 
(Fig. 1B). This appears in the spectral maps 
(Fig. 1, E and F) with the high LDOS of the V1 
band located near the Fermi level, correspond- 
ing to zero sample bias. Above a displacement 
field of 0.2 V/nm, the relatively flat C1 band 
becomes pinned at the Fermi level, and the V1 
band develops a larger dispersion while its 
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108.8 


Experimental 
di/dVv (nS) 
0 2 4 6 


saddle point 


highest LDOS at its saddle point moves to 
lower energies away from C1 with increasing 
positive displacement field (Fig. 1C). This re- 
sults in a bright spectral line associated with 
the saddle point in the V1 band with a negative 
slope in the top portion of the spectral maps in 
Fig. 1, E and F. 


Mapping Landau levels 


Application of a magnetic field creates sharp 
LLs in the d//dV spectrum of all four bands, as 
shown in the d//dV maps at B = 4T and 8 Tin 
Fig. 2, B and D, respectively (see figs. S11 to S13 
for additional data sets). All dJ/dV maps with 
magnetic field are acquired on an ABBC site 
(Fig. 1D, red dot). The experimental maps can 
be compared with the theoretical maps in Fig. 2, 
Cand E, obtained from magnetic field-dependent 
quantum mechanical calculations (42), follow- 
ing the methods in (44). A series of LL spectra 
at different displacement fields are extracted 
from the experimental map at B = 4: T (Fig. 2A). 
The LLs in the V1 band are clearly pronounced 
and show that the LL energies are irregularly 
spaced, showing neither the equal spacing char- 


U =80 meV 
Cc 150 F 
100 ee a a 
hole pocket 2 
on ns ee 8 C, 
2 0 [" ae ~ ~ “E, 
i Vv 
50K , 
<> electron pocket 
100 + Vv, 
0b, ; ; saddle point 
g ki mk g 
F Theory 
B=0T LDOS (arb. units) 


wavelength 2 = (8.1 + 0.1) nm. (Inset) A magnified portion of the image 
showing the atomic lattice of graphene. A twist angle of @ = (1.74° + 0.02°) was 
determined from the atomically resolved moiré lattice (42). The red dot indicates 
the ABBC symmetry position where the spectral maps with applied magnetic 
field were acquired. Tunneling setpoint | = 20 pA; Vg = 100 mV; temperature T = 
0.01 K. (E) Tunneling spectral maps in the D versus Vg plane for B = 0 T. No (=No) 
indicates the electron density at full filling of the C1 (V1) band. Setpoints: 

| = 20 pA, Vg = 120 mV, T = 0.01 K. (F) Corresponding theory map for B = 0 T 
(continuum-model calculation). arb. units, arbitrary units. 


acteristic for a parabolic band nor the square 
root dependence on orbital index as observed 
in graphene. In the next section, we use the 
evolution of the V1 LL spacing to experimentally 
demonstrate the characteristics of the tunable 
band. 

The electron-like versus hole-like character 
of a band is manifested in the energy variation 
with LL index and magnetic field B. LLs orig- 
inating from electron-like Fermi pockets in- 
crease in energy with LL index and B, whereas 
the opposite trend applies for hole-like LLs. The 
saddle-point energy separates electron-like 
from hole-like pockets. With this in mind, we 
examined the experimental spectra of Fig. 2A. 
At negative displacement field, a set of well- 
resolved V1 LLs is clearly visible in the d//dV 
map at B = 4T (Fig. 2B) and the corresponding 
extracted spectra (Fig. 2A). We identified the 
zeroth LL at the bottom of the V1 band, which 
was at Vg = -45 mV for zero displacement 
field. For a given displacement field in Fig. 2A, 
the LL energy spacing in the V1 band is ob- 
served to gradually decrease with increasing 
bias voltage, followed by a gradual increase, 
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Fig. 2. Tuning LLs in TDBG with displacement and magnetic fields. 

(A) Tunneling spectra versus Vg of TDBG at selected displacement fields 
taken from the map in (B). Tunneling spectral map in the D versus Vg plane 
at (B) B = 4 T. (C) Corresponding theory map for B = 4 T (quantum 
calculation). (D) Tunneling spectral map in the D versus Vg plane at B = 8 T. 


pointing to a transition from an electron-like 
to a hole-like pocket through a saddle point. 
The LL spacing is inversely proportional to 
the LDOS, which peaks at the Van Hove sin- 
gularity at the saddle point. The saddle point 
is pinned at the Fermi level for negative carrier 
densities and visible as the high LDOS cross- 
ing through the V1 band at positive densities 
(Fig. 2, B and D). 

We used the B-field dependence of the LLs 
to confirm the nature of the tunable band struc- 
ture. Figure 3, A to C, displays d//dV maps in 
the B versus Vz planes at a fixed displacement 
field, showing the dispersion of the LLs with 
B in the V1 band. At negative displacement 
fields (Fig. 3, B and C), the LLs below the 
saddle point at the Fermi level disperse with 
positive slope, and those above the saddle 
point disperse with negative slope. This con- 
firms the electron versus hole-pocket nature 
of the V1 band. The transition from electron 
to hole pocket in the V1 band is similar at 
positive displacement fields, where increasing 
displacement fields tunes the V1 saddle point 
to lower energies (Fig. 3A). At D = 0.35 V nm 
(Fig. 3A), the saddle point is tuned to the mid- 
dle of the band with an equal number of posi- 
tive and negative dispersing LLs on either side, 
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= 120 mV for (B) and 
spectral map data. 


which demonstrates the well-determined tun- 
ability of electron and hole pockets with dis- 
placement field. 


Deviations from the Onsager relation 


We now zoom in on the detailed features in 
the V1 band, leveraging LL spectroscopy to 
extract quantitative information about the 
zero-field material properties involving valley- 
contrasting orbital magnetism and orbital mag- 
netic susceptibility. Figure 3D shows extracted, 
well-resolved spectra at selected B fields. In 
Fig. 3E, we show the corresponding peak po- 
sitions of the LLs in the V1 band versus B for 
n = 0 to 7, which shows how the LLs disperse 
with magnetic field. LLs are semiclassically 
described by Onsager’s quantization condition, 
which requires that the total phase accumula- 
tion over a cyclotron orbit be an integer mul- 
tiple of 2 (19). This yields 


S(En)0o/2m = 2nBp(n+1/2) (1) 


relating the k-space area S(E,,) of the zero-field 
iso-energy contour of the nth LL E, to the 
magnetic field B,, where oo = h/e is the mag- 


netic flux quantum, / is Planck’s constant, 


LDOS (arb. units) 
02 O04 06 


(E) Corresponding theory map for B = 8 T (quantum calculation) (42). 
The maps in (B) and (D) were acquired on the maxima positions in the 
moiré structure indicated by the red dot in Fig. 1D. Setpoints: / = 20 pA, 


(D), T = 0.01 K. See figs. S11 to S13 for additional 


and e is the elementary charge. With the equal- 
energy k-space area scaling as B,(n + 1/2) 
for all LLs at energy E,, a plot of S<|B,(m + 
1/2)] 0,19... versus LL energy for all LLs would 
collapse onto a single curve. From Fig. 3E, we 
plotS<[B,(n + 1/2)],-9,.7 in Fig. 3F. The fact 
that experimental data points fail to collapse 
onto a single curve demonstrates the in- 
adequacy of the Onsager relation and the need 
for higher-order corrections. 

Recently, a systematic expansion of the semi- 
classical Onsager relation described in Eq. 1 was 
derived by adding corrections in terms of 
powers of B (21): 


By(n + 1/2)/o = S(En) /4n? + 


mi (Ep)Bn + X'(En)B2/2+.. 


The coefficients of the first- and second-order 
correction, 7'(E) and y'(£), are the derivatives 
with respect to energy of the total orbital mag- 
netic moment and orbital magnetic susceptibil- 
ity, respectively. Eq. 2 is thus a powerful tool to 
either infer the zero-field material properties 
S(E), m'(E), and y'(Z) from a given LL spec- 
trum, or to predict the detailed LL spectrum 
given the knowledge of the zero-field material 
properties. In the next section, we extract these 
properties for TDBG from the LL spectrum 
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Fig. 3. Breakdown of the standard Onsager relation in TDBG. Spectral maps in the B versus Vp 
plane for displacement fields of (A) D = 0.35 V nm“, (B) D = -0.05 V nm‘, and (C) D = -0.1 V nm? 
showing the dispersion of the electron and hole-pocket LLs with magnetic field, and the shift of 

the V1 saddle point to lower energies with increasing displacement field. (D) Tunneling spectra 
versus Vp of TDBG as a function of magnetic field at fixed D = -0.05 V nm“. (E) LL peak positions 
versus B for D = -0.05 V nm“'. Data from LLs n = 0 to n = 8 are shown. Symbols, experimental 

peak positions; solid lines, cubic-spline interpolation of experimental peak positions. (F) S(E) 
computed from the solid lines in (E) with Eq. 1. The lack of data collapse indicates that second-order 


contributions to the LL energies are substantial. 


and show a large contribution of y'(Z) for 
moiré systems that scale with an increase in 
moiré lattice constant. 


Extracting zero-field properties 


The first-order correction m'(E) includes con- 
tributions from the total Berry curvature en- 
closed by an equal energy contour (Fig. 4, A 
and B), which yields the well-known Berry 
phase correction, and the average orbital mag- 
netic moment per carrier at energy E (2], 45). 
The calculated Berry curvature Q for the V1 band 
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is represented by a hue in the mini-Brillouin 
zone for each valley K and K’ in the Brillouin 
zone in Fig. 4, A and B, respectively. The 
valleys have equal and opposite Berry curva- 
ture and orbital moment. By symmetry, the 
total intravalley orbital magnetic moment is 
finite only at nonzero displacement field. This 
tunability of the valley-contrasting orbital mag- 
netism enables the disentangling of the orbital 
susceptibility from the orbital magnetic moment, 
which we use in the analysis of the LLs below. 
The strength of these contributions in Eq. 2 


can be evaluated by using the continuum model. 
Figure 4C shows the terms on the right side of 
Eq. 2 for LLO, calculated with the zero-field 
continuum model and multiplied by o,/B to 
obtain the phase contribution in the extended 
Onsager relation in Eq. 2 (see fig. S5 for all 
phase contributions). We note that these phase 
terms are energy dependent and are large in 
TDBG; for comparison, a value of 0.5 would be 
equivalent to the x Berry phase in single-layer 
graphene. The sign of the orbital moment and 
its derivative m'(E) is opposite for the K and K' 
valleys, as expected from the valley-contrasting 
Berry curvature shown in Fig. 4, A and B. 
Figure 4D shows how the phase contribu- 
tions displayed in Fig. 4C drastically alter the 
semiclassical LL spectrum. The semiclassical, 
zeroth-order LL spectrum (Eq. 1) is displayed 
in blue for the lowest LLs in the V1 band by 
using S(EZ) obtained from the zero-field contin- 
uum model. The first-order correction m'(E) 
leads to a valley splitting of the LLs indicated 
by the dashed red and black lines. The calcu- 
lated valley splitting at the bottom of the V1 
band is substantial but within the LL spacing, 
where the LL index can be assigned unambig- 
uously. This makes LLO ideally suitable to de- 
termine the experimental valley splitting from 
our high-resolution measurements. 
High-resolution d//dV spectra of LLO at 
increasing magnetic fields 9 to 13 T at D = 
-0.25 V/nm are shown in Fig. 4E. We observed 
a clear experimental splitting, with the red lines 
indicating the fitted peak positions. At 13 T, the 
LLO peak is very broad owing to many levels 
forming from the Hofstadter spectrum at high 
magnetic fields. We present a comparison to 
the spectrum computed quantum mechanically 
in Fig. 4F (gray peaks) to show the onset of 
Hofstadter minibands at higher fields. The 
values of splitting obtained with semiclassical 
calculations are similar (Fig. 4D). Using an 
energy broadening comparable to the experi- 
ment, shown as the black curve, we obtained 
qualitative agreement with the experiment. 
Figure 4G shows a direct comparison of the 
experimental LLO splitting in Fig. 4E (blue 
data points) with the quantum mechanical 
calculations in Fig. 4F (open orange circles) as 
a function of B field. We found semiquanti- 
tative agreement between the calculated and 
measured valley-splitting values, demonstrat- 
ing the tunable valley-contrasting orbital mag- 
netism present in TDBG. Most notably, the 
splitting was much larger than the standard 
Zeeman effect with Landé g factor of 2 (solid 
black line) and did not increase linearly with 
B field as one would expect for a Zeeman- 
type splitting mechanism. The latter indicates 
that the splitting is not only given by m’(B), 
but rather, a considerable nonzero second- 
order correction x’ leads to nonlinearity in the 
splitting with B. We obtained an estimate of 
m’ in the middle of the V1 band of = 3.5 x 10” 
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Fig. 4. LL splitting originating from valley-contrasting orbital magnetism function of B for D = -0.25 V nm” that show a splitting of the LLs in high magnetic 


and Berry curvature in TDBG. Calculated three-dimensional plot of the V1 


fields. The red lines indicate peak positions determined from nonlinear least-square 


band with Berry curvature shading at U = -40 meV for (A) K and (B) K’ valleys. K __ fitting of a double Lorentzian function to LLO. (F) Quantum calculation of LLO as a 
and K’ valleys have equal and opposite nonzero Berry curvature and equal and function of magnetic field for an interlayer potential U = -40 meV, corresponding to 


opposite orbital magnetic moment (orbital moments depicted schematically). E, is 
the D-dependent energy of the bottom of the V1 band. (€) A comparison of the 
calculated first- (m') and second-order (y') correction magnitude with the extended 
Onsager relation for the V1 TDBG band (U = -40 meV). See fig. S5 and S6 for 
additional phase contribution results (42). (D) Calculation of the first few LLs (n = 0, 
1, and 2) versus B, comparing quantum mechanical to semiclassical methods with 
zeroth-, first-, and second-order corrections. The first-order correction is grossly 
deficient in matching the quantum results, particularly in the n = O level, whereas 
including both first- and second-order corrections provides a good agreement 

with the quantum results (U = -40 meV). (E) Tunneling spectra of the LLO peak as a 


C/(J s), comparable to a calculated estimate 
of ~ 4.4 x 10” C/(J s), using the values for the 
splitting, S'‘Z) and x'(£), extracted from the ex- 
periment (42). 

The first-order correction m’ does not ex- 
plain the lack of collapse of the LLs onto a 
single curve. This is the result of a strong, 
experimentally observable second-order cor- 
rection, given by the energy-dependent orbital 
magnetic susceptibility y'(Z). This contribu- 
tion originates from several mechanisms, as elu- 
cidated in recent theoretical works (18, 46, 47), 
including the geometrical origin, namely the 
Berry curvature and quantum metric (48). The 
calculated y'(E) of TDBG has the same sign for 
both the K and K’ valley, as shown for LLO in 
Fig. 4C. This results in a shift of the valley-split 
LLs, quadratically increasing with B field, 
as shown by the black and red solid lines in 
the continuum-model calculation in Fig. 4D. 
This fully expanded semiclassical approxi- 
mation with first- and second-order correction 
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(Eq. 2) agrees well with the quantum me- 
chanical spectrum obtained for the same 
system (red and black open circles), again 
emphasizing the importance of the second- 
order correction to adequately describe the 
system. For a large region of energies and mag- 
netic fields, the second-order correction is 
larger than the first one (Fig. 4, C and D, and 
fig. S6). This means that the energy-dependence 
of the magnetic field-induced orbital moment 
exceeds the displacement field-induced orbi- 
tal moment. 

To extract the second-order correction to 
the LL energy, we note that the first-order split- 
ting is typically not visually evident for most 
electron-like LLs in the V1 band at smaller 
displacement fields owing to smaller valley 
splitting from orbital magnetism, and pre- 
sumably, to broadening. The energy value of 
a single LL peak is then approximately the 
average of the valley-split values. The valley- 


averaged energy at a fixed displacement field 


a displacement field of D = -0.25 V nm. The LL shows a valley splitting upon 
application of a displacement field. The solid black lines have a Gaussian energy 
broadening of o = 1 meV to match experiment; gray lines, o = 0.05 meV. (G) The 
energy difference of the split n = O LL from (E) compared with quantum calculations 
in (F). For the quantum calculation in higher fields, we plot the splitting as the 
difference in energy between lowest levels in the Hofstadter bands of the K and K’ 
valleys. The uncertainty in the experimental points is derived from twice the 
standard deviation obtained from the nonlinear least-square fits of the peaks in (E). 
For comparison, the solid lines show the calculated Zeeman splitting with 
Landé g factor of (black) g = 2 and (red) 6.5. 


is obtained from Eq. 2 by letting m'(E) = 0, 
keeping only the second-order correction. For 
equal energy LLs n and m with associated 
fields B, and B,,, Eq. 2 yields the following 
for S and x’: 


(7 + 1/2) — B,(m + 1/2) 


2ne 
sea BB, 


B (Bi, — Br) 
(3) 
; e |B,(n+1/2) — Bm(m + 1/2) 
ae: (BB) 
(4) 


Figure 5A shows the average values of S(F) 
obtained by evaluating Eq. 3 for all pairs of LLs 
in Fig. 3E at each energy value. By contrast 
to Fig. 3F, SCE) collapses onto a single curve 
(blue data points) in Fig. 5A, confirming the 
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Fig. 5. Extraction of enhanced energy-dependent orbital magnetic susceptibility in moiré quantum 
matter from the extended Onsager relation. (A) The symbols represent S(E) computed from the solid 
lines in Fig. 3E with Eq. 3, demonstrating data collapse onto one continuous curve. The solid line 
represents S(E) computed for the V1 band from continuum-model calculations. (B) The symbols represent 
x'(E) computed from the solid lines in Fig. 3E with Eq. 4. The solid line represents y'(E) computed from 
continuum-model calculations. See (42) for a discussion of the error analysis. 


applicability of the second-order correction. 
The result has a similar trend to that of the 
zero-field continuum-model calculation of SCE) 
(orange solid line). 

Similarly, we obtained the average value of 
X'(E) by evaluating Eq. 4 (blue data points in 
Fig. 5B). The experimentally extracted values 
are in qualitative agreement with the zero- 
field orbital magnetic susceptibility (orange 
line), computed with the continuum model 
according to the formalism of (49). The un- 
certainty in the extracted y'(£) and the de- 
viation in overlap of segments from different 
pairs of LLs are attributed to two factors. First, 
corrections to Landau quantization energies 
have a smaller impact on the high-index LLs 
located at higher energies (50). Extracting the 
relatively smaller corrections at these energies 
therefore leads to larger uncertainty. Second, 
variations of the chemical potential with mag- 
netic field shift the LL energy positions, espe- 
cially at high magnetic fields (42). We do not 
believe interactions make a qualitative impact 
on our analysis. The LL scaling obtained with 
the extended Onsager relation is highly con- 
strained, and we find it is well satisfied over 
a wide range of electron filling. Presumably, 
interactions would renormalize LL energies in 
a distinct manner, which would depend strongly 
on filling factor. 


Discussion and outlook 


There are several distinct microscopic mecha- 
nisms that underlie the orbital magnetic suscep- 
tibility, including recently identified geometric 
contributions (78, 46). In fig. S9, we show the 
contributions to y'(E) from these mechanisms 
[see (42) for the full breakdown]. The contri- 
butions from the Pauli paramagnetism and 
the quantum geometry involve the orbital mag- 
netic moment, Berry curvature, and quantum 
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metric. These mechanisms make the domi- 
nant contribution to y'(Z) near the top of V1 
and bottom of C1 bands, where the Berry cur- 
vature is peaked (Fig. 4, A and B). The con- 
tributions from the van Vleck paramagnetism 
and k-space energy polarization are dominant 
in the middle of the V1 band, whereas the well- 
known Landau-Peierls susceptibility is sub- 
stantial at the band edges. 

We attribute the large value of y'(Z) to the 
large lattice constant a of the moiré potential. 
This can be roughly understood with dimen- 
sional analysis: for a single band model, the 
total orbital magnetic susceptibility y general- 
ly scales as Wa’, where W is the bandwidth, 
so that the derivative of y with respect to 
energy then scales as a”, which can be large for 
moiré quantum systems. For a more in-depth 
analysis, see fig. S10, A and B, which shows 
that the maximum value of y’ increases dra- 
matically with increasing moiré wavelength, 
indicating that future measurements of orbi- 
tal magnetism should be even more observable 
at larger wavelengths. 

MQM holds great opportunities for efficient 
magnetization control, topological transport 
phenomena, the quantum anomalous Hall 
effect, and technological applications in mag- 
netoelectric, magneto-optics, and topological 
spintronics. Understanding the underlying band 
topology and magnetic response functions 
are the key metrological capability required 
to practically harness all these opportunities. 
Our measurements and analysis point a way 
forward to this goal. 
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An atomic-scale multi-qubit platform 


Yu Wang?+, Yi Chen?34+, Hong T. Bui”>+, Christoph Wolf’, Masahiro Haze”*, 
Cristina Mier’, Jinkyung Kim*®, Deung-Jang Choi"”*°, Christopher P. Lutz’, 
Yujeong Bae">*, Soo-hyon Phark?2*, Andreas J. Heinrich®* 


Individual electron spins in solids are promising candidates for quantum science and technology, where 
bottom-up assembly of a quantum device with atomically precise couplings has long been envisioned. 
Here, we realized atom-by-atom construction, coherent operations, and readout of coupled electron-spin 
qubits using a scanning tunneling microscope. To enable the coherent control of “remote” qubits that 
are outside of the tunnel junction, we complemented each electron spin with a local magnetic field 
gradient from a nearby single-atom magnet. Readout was achieved by using a sensor qubit in the tunnel 
junction and implementing pulsed double electron spin resonance. Fast single-, two-, and three-qubit 
operations were thereby demonstrated in an all-electrical fashion. Our angstrom-scale qubit platform 
may enable quantum functionalities using electron spin arrays built atom by atom on a surface. 


onstructing and coherently controlling 
nanoscale qubit systems lies at the very 
heart of quantum-coherent nanoscience 
(1). This length scale requires the use of 
fundamental quantum properties of atoms, 
such as the spin of electrons, which naturally 
occurs in many solid-state environments and 
allows high-fidelity operations and readout by 
electromagnetic means (2). Despite decades of 
effort, however, it remains a formidable task to 
realize an atomic-scale quantum architecture 
in which multiple electron spin qubits can 


. Sensor 
~ Qubit 


MgO 
Ag 


be precisely assembled, controllably coupled, 
and coherently operated. Electron spin qubits 
created in dopants in semiconductors and color 
centers in insulators, for example, can be well 
controlled individually (3, 4) but are difficult 
to couple together into a circuit. An attractive 
alternative approach is to use atomic-level fabri- 
cation with a scanning tunneling microscope 
(STM), in which atom manipulation (5) or se- 
lective desorption (6) can lead to designed 
quantum spin architectures (7, 8). As a first step 
toward in situ operation of atomic quantum 


: Sensor 


Remote Qubit 
Qubit 


RF E-field 


m 


devices, a radiofrequency (RF) voltage Shes 
used to coherently control a single elect. 
spin in the STM tunnel junction (9, 70) in a 
so-called electron spin resonance (ESR)-STM 
setup (11-14). However, achieving the potential 
of this platform requires multiple addressable 
and detectable qubits that lie outside of the 
subnanometer tunnel junction region. 


Building a multi-qubit platform atom by atom 


Figure 1A illustrates our strategy to construct 
such an atomic-scale multi-qubit platform. A 
“sensor” qubit consisting of a spin-1/2 hydro- 
genated Ti atom on a bilayer MgO film (J4, 15) 
was positioned in the tunnel junction, driven, 
and detected by a magnetic tip (16, 17). Outside 
of the tunnel junction, addressable “remote” 
qubits were created by positioning a spin-1/2 
Ti atom (74, 15) ~0.6 nm away from a single 
Fe atom (/8) using atom manipulation [see 
section 1 of (79)]. An Fe atom can be seen as a 
single-atom magnet in this work because its 
spin relaxation time far exceeds the time scale 
of qubit operations and its resonance frequen- 
cies are far off the applied RF range (11, 18, 20) 
[see section 2 of (19)]. The main purpose of Fe 
atoms here is to substitute for the tip in pro- 
viding a static local magnetic field for remote 
Ti spins (those not directly under the tip apex) 
to allow their electrical driving. Under a tip- 
supplied RF electric field, Ti spins are believed 
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Fig. 1. Bottom-up construction of multiple coupled electron spin qubits. 
(A) Schematic: A sensor spin qubit (Ti, blue) is placed under the apex of a 
spin-polarized STM tip for readout. Remote qubits are constructed at 

precise separations to the sensor qubit by atom manipulation. Each remote 
qubit is composed of a spin-1/2 Ti atom (red) and a single-atom magnet (Fe) 
(green), where Fe’s magnetic field gradient, in combination with the RF 

electric field between the tip and the sample, coherently drives remote qubits. 
(B and C) Constant-current STM images showing atom-by-atom construction 
of a multi-qubit structure composed of two (B) and three (C) qubits (image 
size: 5.0 x 5.0 nm). Inset: atomic registry of the structure in (C). Structure in (B) 
has the same configuration but without remote qubit 2. (D and E) Continuous-wave 
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ESR spectra measured with the tip positioned on the sensor qubit in the 
two-qubit (D) and three-qubit (E) structure. The quantum states of the sensor 
qubit are labeled by the first kets. Each ESR transition of the sensor qubit 
distinguishes a quantum state of the remote qubits (the second kets). 

The measured spin-polarized tunnel-current signal A/ is the difference between 
averaged signals in lock-in A and B subcycles (fig. S10), which reflects the 
change of the sensor spin polarization due to applied coherent control pulses 
[see section 4 of (19)]. Imaging conditions in (B) and (C): sample bias voltage 
Voc = 100 mV, time-averaged tunnel current Ipc = 10 pA. ESR conditions 

in (D) and (E): Voc = 50 mV, Ipc = 20 pA, zero-to-peak RF voltage Var = 30 mV. 
The sample was kept at 0.4 K during measurements. 
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to undergo relative oscillations inside Fe’s mag- 
netic field gradient, thus experiencing an effec- 
tive resonant driving magnetic field (27, 22), in 
analogy to micromagnet driving in quantum 
dots (23) [more details of the driving mecha- 
nism can be found in section 2 of (79) and in 
(24)]. By contrast, ESR of the sensor qubit, as 
well as previous ESR-STM spectroscopy (12-14), 
relies on the tip's magnetic field gradient 
(21, 22). Initialization of the qubits was per- 
formed thermally by cooling the sample to 
0.4 K and applying an external magnetic field, 
producing a thermal spin state having a pre- 
dominant population in the spin ground state 
(~90% under a 0.7 T field). 

A representative structure composed of two 
remote qubits and one sensor qubit was con- 
structed atom by atom using atom manipu- 
lation [see section 1 of (19)], as shown in Fig. 1, 
B and C. The qubit-qubit couplings, including 
dipolar and exchange spin-spin couplings, are 
sensitive to their atomic separations down to 
the angstrom scale (14, 15) (figs. S1 and S2), 
consistent with results in other material sys- 
tems (25). The STM-based atomically precise 
construction scheme thus allowed us to engi- 
neer the resonant frequencies and couplings 
among all of the spins, an essential step for 
addressing and detecting multiple qubits indi- 
vidually [see section 3 of (19) and fig. S2]. 

Detection of the remote qubits was achieved 
through ESR spectroscopy of the sensor qubit, 
the ESR transition frequency of which depends 
on the quantum states of other qubits (Fig. 1, D 
and E). The ESR frequencies of the qubits were 
designed to be sufficiently separated compared 
with the qubit-qubit couplings so that the 
multi-qubit states could be well described by 
Zeeman product states, for which we used 
the first (second) brackets to denote the quan- 
tum states of the sensor (remote) qubit. For 
example, in a structure composed of a sensor 
qubit and a remote qubit (Fig. 1D), the two ESR 
transitions of the sensor qubit at frequencies f, 
(corresponding to transition |0)|0) < |1)|0)) 
and f5 (|0)|1) <> |1)|1)) detect the populations 
of |0) and |1) states of the remote qubit, 
respectively. The difference between f, and 
fo is determined by the Ti-Ti coupling [see 
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section 3 of (79)]. Multiple remote qubits 
can be simultaneously sensed in a similar 
fashion (Fig. 1E). 


Single-qubit operation and readout of a 
remote qubit 


To demonstrate this qubit control and readout 
scheme, we began by measuring a remote 
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qubit’s ESR spectrum with the tip positioned 
above the sensor qubit, as sketched in Fig. 2A. 
The energy levels and ESR transitions of this 
two-qubit structure are illustrated in Fig. 2B, 
and its STM image is shown in Fig. 1B. To 
individually address the sensor and remote 
qubits, we used two RF sources to apply two 
consecutive RF voltage pulses to the STM tip. 
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Fig. 2. Coherent control of a single remote qubit. (A) Schematic of the measurement scheme. Two RF 


tones, RF-S and RF-R, were applied 


to the STM tip for the coherent control of the sensor and remote qubit, 


respectively. The atomic structure used in this figure is shown in Fig. 1B. (B) Energy diagram and ESR 


transitions of the two-qubit system. 


(C) Typical pulse sequence composed of a control pulse on the remote 


qubit (red) followed by a sensing pulse on the sensor qubit (blue). The control pulse can induce conditional 
or unconditional qubit rotations depending on its frequency content. (D) ESR spectra of the remote qubit 
measured with the tip positioned on the sensor qubit. The two curves correspond to different sensing 


frequencies (top, fs = f; bottom, fs 


= fo). Insets: pulse sequence and ESR transitions involved in each 
spectrum. (E) Rabi oscillations of the remote qubit performed by simultaneously driving fp 


= f3 and f4 to 


induce unconditional qubit rotations. The sensing pulse was subsequently applied at fs = f,. Red curve is a 
fit to an exponentially decaying sinusoid. A second y axis on the right shows the population of state |0), which 


is determined by temperature and transition frequency at tg = 0 (here 90%) (3). (F) Two-axis con 
the remote qubit shown by sweeping the relative phase » between two consecutive 2/2 pulses (fr = 


rol of 
fz and fa; 


fs = f,). The data have been vertically shifted by 300 fA to allow easier comparison to (E). @ = 180 degrees 
does not correspond to Al = 0 here because of the RF rectification effect [see section 4 of (19) and fig. S10]. 
Solid curve shows a cosine fit. The axes of the two-step qubit rotation are illustrated by red arrows on 

the Bloch sphere in the rotating frame. ESR conditions in (D): Voc = 50 mV, Ipc = 20 pA, Vs = Va = 30 mV, 
te = ts = 200 ns; and in (E) and (F): Voc = 50 mV, Ipc = 20 pA, Vs = 50 mV, Ve = 120 mV, ts = 200 ns. 
The sample was kept at 0.4 K during measurements. 


2 of 6 


RESEARCH | RESEARCH ARTICLE 


&. e 
» os os 


B 
40 12.5 
WN 30 16.67 
= ot 
= 20 25 Z 
e = 
A 10 50 S 
ie] 
99 30 60 90. 120 
Vp (mV) 
30 16.67 
N v a 
=I 20 256 
= vy Pe 
=) 
o OR O92 
be 
aa 
10 20 30 
Ipc (pA) 


With CNOT 


Fig. 3. Two-qubit operations in the coupled-qubit structure shown in Fig. 1B. 
(A) Controlled rotation of the remote qubit, conditional on the sensor qubit 
state being |0), obtained by driving the transition |0)|0) <> |0)|1) at fp = fs. Red 
points show coherent oscillations of the remote qubit detected through a sensing 
pulse at fs = f;, as shown schematically on the left. Red curve is a fit to an 
exponentially decaying sinusoid. The CNOT operation time is ~13 ns (dotted line). 
(B) Rate of controlled rotations, Q/2z, of the remote qubit. The rate increased 
linearly with the RF voltage Vp (upper) but remained unchanged as the tunnel 
current Ipc (lower) was varied. Error bars are comparable with the size of 
symbols. Corresponding CNOT times are plotted on a second y axis on the right. 
Solid curves are linear fits. Inset illustrates the change of tip-sensor separations. 


(C and D) Controlled rotations of the sensor qubit without (C) and with (D) a 
CNOT operation at fp = fz on the remote qubit. Blue (cyan) points measured 
with fs = f; (f2) show sensor Rabi oscillations (solid lines are exponentially 
decaying sinusoidal fits) only when the remote qubit has a significant population 
in state |0) (|1)). The CNOT operation transferred the predominant population 
from state |0)|0) to state |0)|1), thus causing the opposite trends of the 
oscillations between (C) and (D). Blue curves are shifted vertically by 50 fA 

for clarity. ESR conditions in (A): Vpc = 50 mV, Ip¢ = 10 pA, Vs = 30mV, Vp = 
120 mV, ts = 200 ns; in (C): Vpg = 20 mV, Ip¢ = 7.5 pA, Vs = 100 mV; and in (D): 
Voc = 20 mV, Ipc = 7.5 pA, Vs = Vp = 100 mV. The sample was kept at 0.4 K 
during measurements. 


A control pulse was applied at frequency fR 
to control the remote qubit, followed by a sens- 
ing pulse at frequency fg acting on the sensor 
qubit (Fig. 2, B and C). A sensing pulse applied 
at fg = ff (2) results in an ESR signal that 
depends on the population of state |0) (|1)) of 
the remote qubit. To obtain the ESR spectrum 
of the remote qubit, we swept the frequency 
FR of the control pulse across the remote qubit 
resonances while keeping the sensing pulse 
fixed at a resonance frequency of the sensor 
qubit. When fg matched an ESR transition 
of the remote qubit (e.g., |0)|0) <> |0)|1)), the 
joint two-qubit state populations were altered, 
resulting in a detectable increase (decrease) 
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of the sensor’s ESR signal at f, (/2) [see section 4 
of (79) and Fig. 2D]. This measurement is 
conceptually similar to ensemble double- 
electron spin resonance spectroscopy and 
allowed us to directly obtain the resonance 
frequencies of the remote qubit (/3 and /,; 
Fig. 2D) even though no tunnel current passed 
through it. 

Single-qubit control can be performed in our 
platform by coherently driving a qubit inde- 
pendent of other qubits’ quantum states. This 
was achieved by exciting all ESR transitions of 
a remote qubit with the same driving strength 
(26), as illustrated in the insets of Fig. 2, E and 
F. We measured Rabi oscillations of a remote 


qubit by varying the pulse duration tp of the 
control pulse and subsequently sensing at f, 
(Fig. 2E). 

Two-axis control (27) of the remote qubit 
can be demonstrated by varying the relative 
phase 6 of two consecutive 1/2 pulses (Fig. 2F). 
On the Bloch sphere in the rotating frame, the 
first 1/2 pulse rotates the qubit state from the 
g axis to the y axis, and the second performs a 
1/2 rotation around an axis in the v-y plane 
at an angle of to the a axis (Fig. 2F, bottom 
inset). The resulting signal was well described 
by the expected cos(o) dependence (27). Two- 
axis control allows arbitrary single-qubit oper- 
ations (28). 
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Fig. 4. Three-qubit operations in a multi-qubit 
atomic structure. (A) Schematic of the control 
scheme of a multi-qubit structure (Fig. 1C) 
composed of two remote qubits and a sensor qubit. 
(B) Energy diagram and ESR transitions used for the 
measurement in (C). (€) Controlled-controlled 
operation of remote qubit 1 performed by driving the 
transition |0)|00) <> |0)|10) [(A), red pulse; (B), 
red arrow]. Four different frequencies of the sensor 
qubit were used for sensing the four remote qubit 
states [(A), blue pulse; (B), blue arrows, where 

the detected states of the remote qubits are labeled 
by red kets in (C)]. The two oscillating curves in 
(C) (orange and magenta) correspond to detection 
of the remote qubits’ |00) or |10) states upon the 
controlled-controlled operation at |0)|00) <> |0)| 
10). The CCNOT operation time is ~20 ns (dotted line). 
The other two qubit states were not driven 

so they showed no oscillations. Green, orange, 

and blue curves are vertically shifted for clarity. 

(D) Strategy to perform single-, two-, and three- 
qubit operations in our platform. Single-qubit 
operations are performed by nonselectively exciting 
all ESR transitions of a qubit with the same strength. 
Two-qubit controlled operations were performed by 
selectively exciting ESR transitions that correspond to a 
specific quantum state of one control qubit. Three- 
qubit controlled-controlled operations are performed 
by selectively exciting ESR transitions that correspond 
to a specific quantum state of multiple control qubits. 
ESR conditions in (C): Voc = 50 mV, Ipc = 20 pA, 

Vs = 50 mV, Vp = 80 mV, ts = 200 ns. The sample 
was kept at 0.4 K during measurements. 


Fig. 5. Relaxation and coherence A 
properties of remote qubits. 
The two-qubit structure shown 
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Single-qubit operation 


in Fig. 1B was used in these 
measurements. (A) Relaxation time 
T, of the remote qubit measured 
by an inversion recovery scheme. 
The exponential fit (solid line) 
yields a relaxation time T, of 166 + 
4 ns for the remote qubit. Inset: 
Labels of ESR transitions used 

in this figure (same as Fig. 2B). 
(B) Ramsey measurements of the 0 
emote qubit with fs = f; and fp = 


100 


fz + 30 MHz yielding a coherence 0 
time To* = 86 + 13 ns. Inset: T>* 
shows a dependence on the tunnel 


current Ipc. (€) Spin-echo measurements of the remote qubit measured with fs = f and fp = fz. The exponential fit (Solid line) yields a coherence time of T> 


Delay time t (ns) 


400 


Inset: T2§°"° shows no dependence on Ipc. ESR conditions in (A): Voc = 50 mV, Ipc = 20 pA, Vs = 50 mV, Vp = 
Inc = 10 pA, Vs = 60 mV, Vp = 120 mV, ts = 200 ns. The sample was kept at 0.4 K during measurements. 


Two-qubit controlled operation 

Controlled NOT (CNOT) operations are straight- 
forward to perform in our platform by selectively 
exciting ESR transitions that correspond to a 
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specific quantum state of a control qubit. In 
the case of a two-qubit structure, we could per- 
form a CNOT operation of the remote qubit 
by driving a single ESR transition such as |0)| 


6 October 2023 


Free evolution time t (ns) 


1 a ae 
0 400 800 1200 
Free evolution time t (ns) 


Echo _ 


= 300 + 54 ns. 


120 mV, ts = 200 ns; and in (B) and (C): Voc = 50 mV, 


0) < |0)/1) at fa =f5 (Fig. 3A), which changes 
the state of the remote qubit if and only if the 
sensor qubit is in state |0). Here, the quantum 
state of the remote qubit was subsequently 
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measured by applying a sensing pulse at f{ 
(Fig. 3A). Sensing at fs (instead of at,f,) showed 
oscillations of the opposite sign because this 
effectively detected the population of state 
|1) (instead of state |0)) of the remote qubit 
(fig. S4) (26). The oscillations in Fig. 3A show a 
CNOT operation time of ~13 ns for the remote 
qubit. The gate operation rate increased in 
proportion to the RF amplitude (Fig. 3B, top, 
and fig. S3A), with an upper limit set by RF 
heating effects. The fast operation of remote 
qubits is believed to result from the strong 
magnetic field gradient of the neighboring Fe 
atom (24). As a consequence of Fe-dominated 
driving, the CNOT rate is independent of how 
far the tip is to the remote qubits, as reflected 
by the tunnel current pc (Fig. 3B, bottom, and 
fig. S3B) (24). 

To evaluate the effect of the CNOT operation 
on the remote qubit, we performed controlled 
rotations of the sensor qubit without and with 
this CNOT operation (Fig. 3, C and D, respectively). 
Because the initial thermal population of the 
two-qubit system was predominantly in state 
|0)|0), acontrolled rotation of the sensor qubit 
at f, starting from the initial state |0)|0) (ie., 
without a CNOT operation) showed clear oscil- 
lations because a pulse at f, excites the tran- 
sition |0) |0) < |1)|0). By contrast, measurement 
at transition /, showed no detectable oscillations 
(Fig. 3C) because remote qubit state |1) was 
nearly unoccupied. These coherent oscillations 
were reversed after a CNOT operation of the 
remote qubit at f3 (Fig. 3D), which transferred 
the predominant population from state |0)|0) 
to state |0)|1), hence becoming accessible to a 
sensing pulse at fg = fo (i-e., |O) |1) <> |1)|1)) but 
not f,. We further verified that a CNOT oper- 
ation of the remote qubit at /, (instead of f3) did 
not strongly affect the oscillations observed on 
the sensor qubit (fig. S6), highlighting the se- 
lective, controlled nature of CNOT operations. 


Multi-qubit controlled operation 


Although single- and two-qubit operations are 
sufficient to generate arbitrary quantum cir- 
cuits, multi-qubit operations can significantly 
reduce the execution time and mitigate accu- 
mulated operation errors (29). As long as the 
qubit-qubit couplings can be spectroscopically 
resolved, our platform allows fast, native qubit 
operations having multiple control qubits (by 
selectively exciting certain ESR transitions, as 
shown in Fig. 4D). To demonstrate this ability, 
we constructed a three-qubit structure composed 
of two remote qubits (referred to as RQI and 
RQ2; see fig. S2 for details) and a sensor qubit 
(as sketched in Fig. 4A and imaged in Fig. 1C). 
We selectively drove RQI’s |0)|00) < |0)|10) 
transition (Fig. 4B, red arrow), corresponding to 
a rotation of RQ] if and only if the sensor qubit 
and RQ2 were both in state |0) (Fig. 4A). When 
varying the pulse duration tp, this controlled- 
controlled operation caused oscillating popu- 
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lations between states |00) and |10) of the two 
remote qubits, whereas populations of states 
|O1) and |11) were left unchanged (Fig. 4C). Here, 
the changes of populations were determined 
from the intensities of the four ESR transitions 
of the sensor qubit (Fig. 4A, blue pulse, and 
Fig. 4B, blue arrows). Similarly, the controlled- 
controlled operation |0)|01) <> |0)|11) caused 
oscillating populations between states |01) 
and |11) of the two remote qubits, albeit with 
reduced amplitudes because of the reduced 
initial thermal populations in these states (fig. 
S7E). From these measurements, we obtained 
a controlled-controlled NOT (CCNOT) opera- 
tion time as short as 20 ns. Single-, two-, and 
three-qubit gates in our platform have com- 
parably fast operation times because they only 
differ in the frequency content of the RF pulses 
(30) (Fig. 4D). 


Characterizing the coherence of remote qubits 


In STM-based approaches, tunneling electrons 
have posed severe limitations on the energy 
relaxation time T, and coherence time T, of 
the spins (18, 31). This limitation is overcome 
in our scheme because tunneling electrons pass 
only through the sensor qubit and do not flow 
through the remote qubits (Fig. 1A). To charac- 
terize the remote qubits, we implemented an 
inversion recovery measurement by first apply- 
ing az pulse on the remote qubit to invert its 
population. After a delay time 1, we applied a 
sensing pulse to the sensor qubit to measure 
the state of the remote qubit and obtained 
an energy relaxation time 7, = 166 + 14 ns (J4). 
A pronounced improvement can be seen in 
the quantum coherence of remote qubits, which 
is already visible in the higher quality of the 
remote qubit’s Rabi oscillations (Fig. 3A) com- 
pared with the sensor qubit (Fig. 3, C and D). 
Figure 5B shows the Ramsey signal measured 
at a tunnel current of [pc = 10 pA, from which 
we extracted a coherence time 7>* = 86 + 13 ns. 
We found that 7,*, the coherence time subject 
to inhomogeneous broadening, depended on 
the tunnel current and thus on the tip height 
(Fig. 5B, inset, and fig. S8A), suggesting that 
the proximity of the STM tip to the remote 
qubits influenced their quantum coherence time 
despite the absence of tunnel current through 
them. This decoherence effect was likely caused 
by slow thermal or field fluctuations, the lat- 
ter possibly arising from slight, uncontrolled 
tip motions (37). To cancel the effect of this 
inhomogeneous broadening, we performed a 
spin-echo measurement and observed that the 
measured Ym showed negligible dependence 
on the tunnel current (Fig. 5C, inset, and fig. 
S8B). The measured coherence time T>="° 
300 + 54 ns (Fig. 5C) approaches the theo- 
retical limit of 27;. These measurements high- 
light that the quantum coherence of remote 
qubits after spin-echo filtering is limited by 
energy relaxation events, in contrast to other 


solid-state qubits (3, 27), where T, >» To®*P° 
usually applies. 


Discussion and outlook 


Our work introduces an atom-by-atom con- 
structed qubit platform using electron spins 
on surfaces where cryogenic initialization, 
universal multi-qubit operations, and multi- 
qubit detection have been demonstrated. Two 
types of future developments will improve the 
performance of this qubit platform. The first 
seeks to obtain more gate operations within 
the spin decoherence time. Our fast gate oper- 
ation of ~20 ns implies that a significant im- 
provement can be attained with a moderate 
increase of 75, e.g., through the use of thicker 
insulators (18), a detection mechanism that 
avoids magnetic tips (32), the depletion of sur- 
rounding nuclear spins (~5% of atoms in MgO 
have nuclear spins), and the use of quantum gates 
that are protected by dynamical decoupling- 
schemes (33). For reference, electron spins in 
bulk insulators (34) and semiconductors (35) 
have been demonstrated with coherence times 
of ~1 ms. 

The second development seeks to increase 
the number of controllable qubits. Given that the 
RF electric field from the tip can have a radiation 
range of tens of nanometers (corresponding 
to the effective tip diameter) (36), scaling up 
the number of remote qubits should not be 
limited by driving but rather by detection. A 
direct extension of the work shown here would 
lead to up to four remote qubits being placed 
around a sensor qubit for direct read-out. Fur- 
ther increase will rely on remote qubits at next- 
nearest neighbor sites, where a SWAP gate (28) 
could be used to transfer the quantum infor- 
mation (37). Finally, the computational Hilbert 
space can be further expanded by using mo- 
lecular qubits to construct three-dimensional 
architectures (38) or by using qudit systems 
with multiple electron and nuclear spin states 
(39, 40). Such extensions rely on the specific 
benefit of our surface-based approach, in which 
a myriad of available spin species (17, 41-43) 
and geometries (44, 45) can be precisely assem- 
bled using scanning probe techniques. 
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PHOTOSYNTHESIS 


A chlorophyll c synthase widely co-opted 


by phytoplankton 


Yanyou Jiang’, Tianjun Cao"+, Yuging Yang’, Huan Zhang", Jingyu Zhang?, Xiaobo Li 


1,25 


Marine and terrestrial photosynthesis exhibit a schism in the accessory chlorophyll (Chl) that 
complements the function of Chi a: Chl b for green plants versus Chl c for most eukaryotic 
phytoplankton. The enzymes that mediate Chl c biosynthesis have long remained elusive. In this work, 
we identified the CHLC dioxygenase (Phatr3_J43737) from the marine diatom Phaeodactylum 
tricornutum as the Chl c synthase. The chic mutants lacked Chl c, instead accumulating its precursors, 
and exhibited growth defects. In vitro, recombinant CHLC protein converted these precursors 

into Chl c, thereby confirming its identity. Phylogenetic evidence demonstrates conserved use of 
CHLC across phyla but also the existence of distinct Chl c synthases in different algal groups. 

Our study addresses a long-outstanding question with implications for both contemporary and ancient 


marine photosynthesis. 


hotosynthesis has shaped and continues 

to shape our landscape, oceans, and at- 
mosphere. Photosynthetic creatures from 
terrestrial and aquatic environments differ 

in their light-harvesting systems. In land 
plants and certain cyanobacteria, chlorophyll 
(Chl) 6 is used to complement the absorption 
spectrum of the commonly harbored Chl a; by 
contrast, many eukaryotic algal groups, despite 
being morphologically and phylogenetically di- 
verse, have consistently adopted Chl ¢ as the 
accessory chlorophyll (7). The existence of Chl ¢ 
as the third chlorophyll species was reported as 
early as 1864: (2). However, its biosynthetic mech- 
anism was poorly understood, in contrast to 
that for cyanobacterial Chl f, whose synthase 
was identified shortly after its discovery (3, 4). 
Phylogenetically, Chl c-containing algae to- 
gether with some of their nonphotosynthetic 
relatives are termed the CASH lineages (where 
CASH stands for cryptophytes, alveolates, 
stramenopiles, and haptophytes) (5). Among 
them, cryptophytes notably harbor four genome 
compartments; many dinoflagellates (alveolates) 
cause toxic red tides or serve as endosymbionts 
in corals; diatoms (stramenopiles) contribute 20% 
of global primary productivity; brown algae (also 
stramenopiles) make up kelp forests as habitats 
of marine creatures; and coccolithophores (hap- 
tophytes) precipitate carbon dioxide from the air 
into rocks (6, 7) (fig. SIA). Thus, Chl c is of great 
relevance in contemporary marine photosynthe- 
sis and global biogeochemistry. However, our 
poor understanding of Chl c biosynthesis and, 
as a result, the absence of mutants with a primary 
defect in Chl c accumulation have hindered 
our direct characterization of the functions of 
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Chl c. Furthermore, synthetic biology efforts to 
transfer the blue-green light-capturing, Chl c- 
containing antennae (6) to other phototrophs 
is not possible without knowing the identity of 
the Chl c synthase. 

Elucidation of Chl c biosynthesis is also 
highly relevant to our understanding of the 
evolution of aquatic phototrophs. All Chl 
c-containing algae harbor plastids of red algal 
(eukaryotic) origins; yet, whether they acquired 
the plastid from a single red alga engulfment 
event or serially has been debated (8) (fig. S1, B 
and C). The identification of the enzymes in- 
volved in the biosynthesis of Chl c—a pigment 
shared by these algae but lacking in red algae— 
will shed light on the evolutionary history of 
these red plastid-bearing algae (9). Recent ad- 
vancements in the diatom genetic toolbox offer 
opportunities for studying Chl c biosynthesis. 


A predicted oxidoreductase, CHLC, is required 
for Chl cz and Chl c, accumulation in a diatom 


We aimed to identify the Chl c synthase from 
the genetically tractable marine diatom Phaeo- 
dactylum tricornutum, which harbors Chl c, 
and Chl c, (10). We first applied CRISPR-Cas9 
technologies that were recently developed for 
this alga (JJ) to obtain mutants for candidate 
genes. On the basis of structural features, re- 
searchers have hypothesized that Chl c. could 
be derived from the common chlorophyll pre- 
cursor divinyl protochlorophyllide a@ (DVP) 
(12) through a single desaturation reaction, 
whereas Chl c, could be generated through the 
same conversion on monovinyl protochloro- 
phyllide a (MVP) or through reduction of the 
C8 vinyl group in Chl c, by divinyl reductases 
(1, 13) (Fig. 1). Thus, we screened for genes 
containing oxidoreductase domains whose 
mutation would abolish the accumulation of 
the two Chl c species and prioritized those co- 
expressed with known chlorophyll biosynthe- 
sis genes or light-harvesting complex (LHC) 
genes (74) (data S1 and materials and methods). 


0 


For 20 genes, we successfully obtained | Che: 
upc 
tant transformants, but none of them bo-— 
visible color phenotype (data S1). By contrast, 
for the gene annotated to encode a phytanoyl- 
CoA hydroxylase (PHYH; Phatr3_J43737; re- 
named CHLLC, as explained below) (Fig. 2A), 
many transformants appeared greenish (Fig. 2B), 
which contrasts with the brown color of wild-type 
(WT) P. tricornutum cells. PHYH (EC 1.14.11.18) 
was originally discovered in mammals (/5), and 
it is a 2-oxoglutarate (20G)-dependent dioxy- 
genase that catalyzes hydroxylation of a car- 
bon atom. However, enzymes of this family 
can catalyze a variety of reactions. For exam- 
ple, a bacterial 20G-dependent dioxygenase, 
the carbapenem synthase CarC (EC 1.14.20.3), 
catalyzes the formation of a C=C double bond 
at the a- and B-positions of the carboxyl group 
(16), similar to the scenario of Chl c generation 
from DVP or MVP (Fig. 1). We thus focused on 
characterizing the potential function of CHLC 
in Chl c biosynthesis. 

To ascertain that the green coloration re- 
sulted from the disruption of CHLC, we genotyped 
both green- and brown-colored transformants. 
We found that all green-colored mutants har- 
bored frameshift-causing mutations or large 
deletions in the CHIC gene (fig. S2; primers 
in data $2), and we subcloned three randomly 
chosen colonies to yield the pure, stably 
maintained chlc-1, chic-2, and chic-3 mutant 
lines (Fig. 2, D, E, and G). By contrast, all the 
brown transformants showed the WT se- 
quence, indicating that their respective CHLC 
gene sequences were not disrupted (fig. $2). 
Complementation of a representative mutant, 
chic-1, with a copy of CHLC on an episome 
that encodes a WT CHLC protein fused with 
the green fluorescent protein (GFP) at its C 
terminus restored the WT appearance (Fig. 
2, C to G), which demonstrates that CHLC 
is required for the normal coloration of 
P. tricornutum cells. 

To identify the specific metabolites affected 
by CHLC mutations, we performed pigment 
analyses using high-performance liquid chro- 
matography (HPLC) (data S3). We observed that 
chic mutants exhibited abolished Chl c. and c 
accumulation and a lowered fucoxanthin-to- 
Chl a ratio (Fig. 2H). These results may reflect 
a combination of primary and secondary effects 
of CHLC loss, in which the primary outcome of 
CHLC loss is on one of the light-harvesting an- 
tenna pigments (Chl ¢ or fucoxanthin) and the 
secondary impact is on the accumulation of the 
other one, as we have recently reported (17). 

Our HPLC pigment analyses also revealed 
an accumulation of peaks that we assigned as 
DVP and MVP on the basis of their absorbance 
profiles (78) (Fig. 2H). Because of the similar 
absorbance spectra of Chl ¢., Chl c¢, DVP, and 
MVP as well as the isomeric relationship be- 
tween Chl c, and DVP, we further analyzed their 
fragmentation patterns in liquid chromatography- 
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tandem mass spectrometry (LC-MS/MS) (fig. 
83). The two pigments assigned as Chl cz and 
Chl c, were purified together from WT cells 
and analyzed as a single sample. They both 
exhibited the [M+H-18]* dehydration fragment 
in their secondary mass spectra; this fragmen- 
tation was only reported in chlorophylls with 
an acrylic side chain at C17 (9). By contrast, the 
pigments assigned as DVP and MVP were puri- 
fied from the chlc-1 mutant, and neither of 
them gave rise to [M+H-18]* fragment peaks 
under the same conditions (fig. S3). These 
results confirm our identification of the four 
compounds. 

The accumulation of DVP and MVP but not 
fucoxanthin precursors is opposite to our ob- 
servations in fucoxanthin biosynthesis mu- 
tants (17). Thus, these results suggest that 
CHLC is directly involved in Chl c—but not 
fucoxanthin—biosynthesis. 


Recombinant CHLC produces Chl cz and c; 
from precursors accumulated in chic mutants 


To directly test the function of CHLC in Chl c 
biosynthesis, we investigated its activity in vitro. 
We produced recombinant CHLC proteins (la- 
beled PtCHLCs) and incubated them with the 
DVP-MVP mixture isolated from the chlc-7 mu- 
tant. Chl c. and c, were readily produced (Fig. 
3A). The conversion depended on the presence 
of 20G and ferrous iron and was hampered 
when ascorbate was omitted. These require- 
ments are consistent with other 20G-dependent 
dioxygenases (1/6), which suggests that the 
CHLC reaction involves hydrogen atom trans- 
fer followed by electron transfer, as observed 
for the 20G-dependent desaturase CarC (20, 21). 
Next, we performed an additional round of 
purification to obtain DVP free of MVP to be 
used as the substrate and found that CHLC 
was able to convert DVP into Chl cy (Fig. 3B); 
this cleaner assay further confirms the assign- 
ment of CHLC desaturase activity illustrated 
in Fig. 1. Therefore, on the basis of the above 
results, we propose to rename this protein Chl 
c synthase (CHLC). 

We next explored the mechanistic basis of 
certain structural features of Chl c (22). Unlike 
Chls a, 6, d, and f, which contain a chlorin 
skeleton (with a reduced D ring), the two 
major Chl ¢ species, Chl c. and Chl ¢;, retain 
the porphyrin skeleton (with a nonreduced 
D ring; see Fig. 1) from the precursors DVP 
and MVP. In canonical chlorophyll biosynthe- 
sis pathways, the conversion of the porphyrin 
skeleton into a chlorin skeleton is catalyzed 
by protochlorophyllide oxidoreductases (PORs) 
(23) (Fig. 1). We hypothesized that the porphyrin 
skeleton retention in Chl c, and c; may depend 
on the substrate specificities of CHLC and 
PORs—i.e., the conversion of DVP and MVP 
into Chl ¢, and c, by CHLC may preclude fur- 
ther conversion by PORs, and conversely, the 
conversion of DVP into divinyl chlorophyl- 
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Fig. 1. An overview of the key steps of Chi c biosynthesis in P. tricornutum. Details of the steps from 
glutamate to DVP and the steps from monovinyl chlorophyllide a (MVC) to Chl a are skipped for simplicity. 
Carbon atoms 8, 171, and 17° and the five rings are labeled on DVP. Enzymes (shown as ovals) include 


CHLC (this study), divinyl reductase (DVR), and POR. 


Intermediates and cofactors include 20G, DVC, 


DVP, MVC, and MVP. Shaded blocks indicate structural features generated by the corresponding enzymes 


(brown, CHLC; blue, POR; green, DVR). The circles in 
chlorophylls. NADP*, nicotinamide adenine dinucleotid 


lide a (DVC) may prevent the action of CHLC 
(Fig. 3C). 

We prepared DVC as a substrate for CHLC 
using in vitro reactions catalyzed by a charac- 
terized plant POR protein (Arabidopsis thaliana 
PORB) (24). We successfully obtained DVC 
based on the product’s absorbance spectrum, 
including a prominent, red-shifted Qy band at 
667.3 nm (fig. S4; see materials and methods 
for purification details). If CHLC could act on 
DVC, we expected that the addition of CHLC 
would generate a product with a strong Qy 
band as well but with an earlier elution profile 
compared with that of DVC based on the differ- 
ences in elution between DVP (MVP) and Chl ¢5 
(Chl c,) on our HPLC systems (Fig. 2H and Fig. 
3A). We did not find such a product. All the 
contaminant peaks associated with our DVC 
preparation or generated by CHLC had low 
absorbance in the red region, characteristic of 
compounds containing a porphyrin skeleton 
(Fig. 3, D and E). These results suggested that 
DVC could not act as a substrate for CHLC. 

Previously, land plant PORs have been found 
not to act on Chl ¢, and Chl ¢, (25). Assuming 
that PORs in P. tricornutum and plants have 
similar substrate specificities, the data suggest 
that CHLC and PORs both act on DVP ina 
mutually exclusive manner, and this can ex- 
plain how the porphyrin skeleton is retained 
in Chl ¢ (Fig. 3C). 


CHLC is chloroplast localized and is required 
for normal physiology 


We took advantage of the chic mutants to 
understand the function of the CHLC protein 


magenta highlight the protonation site absent in other 
e phosphate; NADPH, reduced form of NADP*. 


and Chl c. Proteomic comparison of chlc-1 with 
WT cells revealed multiple processes affected 
in the mutant, reflected as altered abundance 
of many proteins, although with a modest 
false discovery rate (FDR). The changes in- 
clude lower relative abundance of some of 
the chlorophyll-binding proteins in the Lhcf 
family, lower abundance of a flavodoxin, and 
higher abundance of some of the reaction center 
subunits of the two photosystems (data S4). 
Notably, a few proteins annotated as heat shock 
protein Hsp20 isoforms were also up-regulated 
in the mutant. Previously, hyperaccumulation 
of Hsp20 family members has been observed 
in dark-treated P. tricornutum cells (26). Thus, 
our results are in line with Chl ¢ playing an 
important role in photosynthesis. 

The chilc-1 mutant showed a decreased con- 
tent of Chl a on a per-cell basis compared with 
the wild type, but the decrease of fucoxanthin 
was more prominent (fig. S5A), which explains 
the greenish appearance of the mutant (Fig. 
2D). Consistently, the whole-cell absorbance 
spectra showed chic mutants to be deficient in 
relative absorbance around 450 to 580 nm, 
around the absorbance peaks of Chl c and 
fucoxanthin (6) (fig. S5B). This absorption de- 
fect led to an actual effect on photosynthetic 
energy harvesting based on chlorophyll fluores- 
cence measurements at 77 K—the chlc-1 mutant 
had lower amplitudes compared with the wild 
type, with excitation wavelengths from 450 to 
550 nm (fig. S5C). 

Consistent with a function of Chl c and fuco- 
xanthin in light harvesting, expression of 


CHIC in the wild type decreased after being 
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transferred from low-light to high-light con- 
ditions (fig. S5D); as controls, the chlorophyll 
biosynthesis gene CHLM and an LHC gene 
LHCRI0 were down-regulated and up-regulated 
by high light, respectively, consistent with pre- 
vious studies (27). chic mutants showed growth 
defects under low light but not as promi- 
nently during the first several days under 
high light. The mutants did grow slower than 
the wild type in the later stage under high 
light, possibly as a result of the reduced light 
availability to cells caused by a higher cell 
density (fig. $5, E and F). 

We used the DiatOmicBase (https://www. 
diatomicsbase.bio.ens.psl.eu/) page of CHLC 
to learn its behavior in previous genome-wide 


Cc 


CHLC-F gesgRNA (131-150) 100 bp 
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li Dioxygenase domain 


VECT-F 
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studies. CHLC was predicted as a chloroplast 
protein by ASAFind (28). We confirmed that 
the GFP signal from the CHLC-GFP fusion 
protein coincided with the chlorophyll auto- 
fluorescence in COMP! cells (fig. S6). Besides 
high light, previous studies have shown that 
nitrogen limitation, a commonly occurring stress 
for phytoplankton, also lowers the expression 
of CHLC (29). Notably, H3K9/14:Ac (active chro- 
matin mark) and H3K4me2 histone modifica- 
tions have been shown to occur at the CHLC 
locus (30). Under nitrogen limitation, both 
marks exhibited lower signals and upstream- 
shifted peaks (fig. $7). These results suggest 
epigenetic mechanisms to be involved in the 
regulation CHLC expression. 


Episome in COMP1 
Chee 


Phylogenetic survey of CHLC within 
ochrophytes suggests the presence of at least 
one distinct, yet-unknown Chi c synthase 
We investigated the distribution of CHLC among 
representative ochrophyte algae (photosynthetic 
stramenopiles; fig. S1, Fig. 4A, and data S5). 
We noted that diatoms (Bacillariophyta), Boli- 
dophyceae, Pelagophyceae, and Dictyophyceae 
all have both Chl c (6, 78) and CHIC orthologs 
[E value cutoff 1E~?° (1 x 10~?°)]. These algae 
are closely related and belong to the so-called 
Diatomista group, one of the two groups within 
ochrophytes (37). 

By contrast, brown algae (Phaeophyceae), 
Raphidophyceae, and Synchromophyceae (not 
shown in Fig. 4A) that belong to the Chrysista 


CHLC-F / CHLC-R 
750 bpi——— =] 2kb 


300 bp: 


VECT-F / GFP-R 


— 


23 4 5 


123 4 5 1 


VECT-F / GFP-R 


B D F 
> > ‘ —_________—— ——__—__—— 
ee foe i —— I ] awww Wwwlawnrad 
e 4 “s ac s e ae <6. . e CGTCMAAGACTGTGGAAGCAGAGCAAGTCC 
lz oe * . Wild type chic-1 —chic-2_—schic-3.. += COMP 1 125 COMP1 (5) 154 
G CHLC-F / CHLC-R H 
S ] Fx 
2) 
2 , Chl a 
CGTCCAATACGGTAGAAGCCGAAGAAGTCC e } Chie gig Ddx Wild type 
T T 1 
125 Wild type (1) 154 itd / 
300 800 300 800 : 
Wavelength [nm] 
CGTCCAATA-GGTAGAAGCCGAACAAGTCG E] Chie, Chic, 7 
125 chic-1 (2) (1 bp deletion) 14 g \ \ Madyee e 
6 3 
G DvP : 4 
3 iy chic-1 ® 
CGTCCAATACG — ~~ — AAGCCGAACAAGTCC ie ee |, Cee 8 
125 chie-2(3) (4 bp deletion) 154 ra coMP1 = 
a4 
25 30 35 
Retention time [min] 
CACCAGT — ACTATCATTCTGCTAT 
118 124 539 6 DvP MVP 
chic-3 (4) (414 bp deletion) a hack peed 
e 629.2 629.2 
CGTCIAAGACIIGTEGAAGCAGAGCAAGTCC 300 800 300 800 i z 46 We 


125 154 


COMP1 (5) 
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with an intact CHLC-GFP fusion gene. (G) Genotyping results of the chic mutants 
and COMPI1. Sequencing results from the PCR products in (E) are shown. 
Nucleotide positions in the coding sequence of CHLC are labeled. For COMPI, 


and CHLC-R and also the position of the single-guide RNA (sgRNA) used in 
CRISPR-Cas9 mutagenesis. (B) Part of an agar plate containing colonies from a 
transformation designed to disrupt CHLC. Green and brown arrows point to 
example transformants that we rated as green and brown clones. (€) A 
schematic representation of the additional, episomal copy of the CHLC gene 
present in the complemented COMP] line. (D) A photograph of liquid cultures of 
the wild type, three mutants disrupted in CHLC, and the COMP1 line. (E) Agarose gel 
electrophoresis of polymerase chain reaction (PCR) products yielded from wild 
type, chic mutants, and the COMPI line using the two primer pairs shown in (A) 
and (C). Strains are numbered consistently across panels (E) to (G). bp, base 
pair. (F) Part of the sequencing results from the PCR product yielded by COMP1 
using the primer pair CHLC-F—-GFP-R; the size and sequence are consistent 
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the sequence signals were mixed because both the mutated copy and the episomal 
copy on the episome were amplifiable using the CHLC-F—CHLC-R primer pair. In 
(F) and (G), synonymous substitutions in the episomal copy of CHLC are shown with 
a red shade. These were designed to avoid further editing of this episomal copy 
(see materials and methods). (H) HPLC analyses of pigment extracts of wild type, 
chic mutants, and the COMP1 line. The traces were normalized against the height 
of the Chl a peak in each sample. The parts containing Chl cz and Chl c, peaks 
in wild type and COMP1 as well as that containing DVP and MVP peaks in 
chlc-1 are magnified and shown on the left for clarity. The absorption spectra 
of these four peaks are presented on the left. Car, carotene; Ddx, diadinoxanthin; 
Fx, fucoxanthin; Rel. Abs., relative absorbance. For (D) and (H), representative 
results from no less than three replicates are shown. 
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group contain Chl c but not CHIC orthologs 
(6, 18, 32). In a recent study, we have reported 
that some of these Chrysista algae accumulate 
fucoxanthin as the major xanthophyll used for 
light harvesting, like diatoms do, but use a 
biosynthetic path different from that used by 
diatoms (17). Our findings in this work suggest 
a coevolution between the chlorophyll and 
carotenoid components of the light-harvesting 
antennae. It is possible that Diatomista and 
Chrysista algae each evolved Chl c and fuco- 
xanthin biosynthesis separately. Alternatively, 
Chl c and fucoxanthin may have been present 
in the common ancestor, and one of the two 
lineages lost their biosynthetic pathways and 
then evolved different pathways. These hy- 
potheses remain to be tested. 

Among other Chrysista groups, Eustigmato- 
phyceae do not synthesize Chl c and do not 
harbor any gene highly similar to PtCHLC, 
whereas Chrysophytes include species with or 
without Chl ¢ (6, 18, 33); no CHLC ortholog was 
found in their genomes or transcriptomes. For 
Pinguiophyceae, a consensus has not been 
reached on whether they should be grouped 
into Diatomista or Chrysista (31, 34). They 
contain both Chl c (18) and CHLC (data S5). 

We caution that some of the sequence data- 
sets used in our surveys are transcriptomes, 
and certain genes may have not been detected 
because of their low expression levels under 
the given conditions (data S5). Nevertheless, 
CHIC is consistently absent in the genomes 
and transcriptomes of Chrysista species ex- 
amined, which increases our confidence in the 
presence of at least one additional synthase 
of Chl c. 


CHLC is widely distributed within 
cryptophytes, haptophytes, and 
dinoflagellates and is the predominant Chl c 
synthase among known species 


We then examined the occurrence of CHLC 
beyond ochrophytes. When PtCHLC was searched 
by BLAST against Tara Oceans Single-Cell and 
Metagenome Assembled Genomes sequences 
for marine eukaryotes (EUK-SMAGs) and 
the unassembled Marine Atlas of Tara Oceans 
Unigenes (MATOU) metatranscriptomics data- 
base (35-37), prevalent signals were observed 
across the sampling stations (fig. S8, A and C). 
The EUK-SMAGs hits were from various groups, 
especially haptophytes and ochrophytes (fig. SSB). 

To obtain evolutionary insights into the CHLC 
family, we combined two approaches to search 
for its relatives. We (i) surveyed major groups 
of photosynthetic organisms with existing ge- 
nomes or transcriptomes and (ii) performed an 
unbiased BLAST search in the UniProt database 
that includes heterotrophic organisms [E val- 
ue cutoff IE (1 x 10°°)]. CHLC orthologs 
with an E value smaller than 1E~”° (<1 x 10-”°) 
were found in cryptophytes, haptophytes, and 
many dinoflagellates in addition to Diatomista 
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Fig. 4. Occurrence of CHLC and related proteins across the red and green 
lineages of photosynthetic eukaryotes. (A) The chlorophyll species and 
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Chl co, Chl cy, Chl c3, Chl Ces-779, and their esters with lipophilic molecules (18). 
Pigment compositions are displayed according to recent studies or reviews 


ochrophytes but were not found in glaucophytes, 
green algae, land plants, and chromerids. Two 
cyanobacterial genes, several genes from het- 
erotrophic bacteria, and a number of genes 
from photosynthetic eukaryotes were deteced 
with a larger E value or lower identity com- 
pared with PtCHLC (Fig. 4A and data S5). 
CHIC orthologs were also absent in red algae 
as well as the plastid-less relatives of Chl c- 
containing algae, such as Goniomonadaceae, 
Centroplasthelida, Ciliophora, Oomycota, and 
Actinophryidae. This occurrence pattern 
corresponds well with the ability to synthe- 
size Chl c among these groups (6, 18) (fig. SIA 
and Fig. 4A). 

We experimentally tested the enzymatic ac- 
tivity of three representative CHLC orthologs— 
from the cryptophyte Guillardia theta (Gt), the 
haptophyte Phaeocystis globosa (Pg), and the 
dinoflagellate Amphidinium carterae (Ac). All 
three proteins converted DVP and MVP into 
Chl c, and Chl ¢, in vitro (Fig. 3F), demonstrat- 
ing a conserved function of the CHLC family 
among the four CASH lineages. 

To visualize the phylogenetic relationship of 
CHLC orthologs within the four CASH line- 
ages, we constructed a phylogenetic tree (Fig. 
4B) with these proteins using blast hits from 
cyanobacteria, mosses, and heterotrophic bac- 
teria as an outgroup; all these organisms lack 
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Chl ¢ and are well known to be distant from 
the CASH lineages. Among the high-identity 
(>40%) hits of CHLC (Fig. 4B and data S5), 
there appear two main clades: One was com- 
posed of dinoflagellates with a fucoxanthin- 
less plastid, and the other was composed of 
ochrophytes (Diatomista), haptophytes, and 
cryptophytes. The latter also contained dino- 
flagellates with a fucoxanthin plastid. Kare- 
niaceae species are a group of dinoflagellates 
harboring fucoxanthin plastids that originated 
from engulfed haptophytes to replace the an- 
cestor peridinin plastid (38) (arrow 1 in Fig. 
4A). Their CHLC genes are also related to the 
haptophyte ones (Fig. 4B), which suggests a 
late plastid origin. Haptophytes themselves 
were hypothesized to contain a plastid (and its 
genome) from an engulfed cryptophyte (39) 
(arrow 2 in Fig. 4A). In addition, multiple hap- 
tophyte proteins predicted with a chloroplast 
localization seem to be of an ochrophyte ori- 
gin, and one existing explanation is that an 
ochrophyte related to pelagophytes and dic- 
tyochophytes was once engulfed as an earlier 
plastid of haptophytes (40-42) (arrow 3 in Fig. 
4A). If the latter is true, CHLC orthologs in 
haptophyte nuclear genomes may be of this 
origin, considering the close clustering be- 
tween haptophytes and this group of ochro- 
phytes in the tree. Many MATOU sequences 
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(6, 18). The three arrows indicate three endosymbiotic events that were 
mentioned in the text. (B) A maximum-likelihood phylogenetic tree of the CHLC 
family. The green-filled stars indicate PtCHLCs and functionally validated CHLC 
orthologs, whereas the gray-filled stars indicate proteins unable to generate 

Chl c based on in vitro assays (Fig. 3F and fig. S10). Please see the text for a 
discussion of outgroup selection. Symbio-LID refers to Symbiodiniaceae low- 
identity blast hits. Protein names are edited to include letters from the genus or 
species names. For the original IDs, see data S5. 


with unknown species information cluster with 
known ochrophyte and haptophyte species 
(fig. S8D), which suggests that the relationship 
between ochrophyte and haptophyte CHLC 
orthologs can be better resolved when more 
species can be cultured for sequencing or are 
sequenced at the single-cell level. 

A group of dinoflagellates (Symbiodinium 
sp. CCMP421, Breviolum minutum, Cladocopium 
goreaui, and Fugacium kawagutii) of the family 
Symbiodiniaceae (order Suessiales) had two or 
three blast hits with an E value smaller than 
1E °° (<1 x 107°), but for each species, one of 
the hits had a low identity (below 30%; data 
85). These Symbiodiniaceae low-identity pro- 
teins (referred to as Symbio-LID hereafter) form 
a branch separate from the high-identity CHLC 
orthologs (Fig. 4B). For Symbiodinium sp. 
CCMP421, both the high-identity hit CAMPEP_ 
0181423024 (labeled Symsp_0181423024 in Fig. 
4B) and the low-identity CAMPEP_0181492860 
were functionally tested in vitro. Only the former 
(referred to as SSCHLC hereafter) generated 
Chl c (fig. S10). This may be related to the lack 
of large stretches of amino acids (correspond- 
ing to PtCHLC positions 255 to 279 and 321 
to 336) and changes of residues around the 
“HXD/E” motif in the active site (43) in Symbio- 
LID compared with active CHLC members 
(fig. S11). 
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To probe into the emergence of the activity 
of CHLC in evolution, we constructed an un- 
rooted phylogenetic tree using blast hits sat- 
isfying the E value cutoff of 1E~? (1 x 10°) from 
groups beyond CASH lineages (data S5, fig. S9, 
and materials and methods). Notably, the 
CHLC orthologs with a high identity (arger 
than 40%) in the four CASH lineages still form 
a monophyletic group. Low-identity hits from 
bacteria, Hexanauplia, Chlorarachniophyceae, 
Bryopsida, and algae of the CASH lineages are 
intermixed, obscuring the origin of the CHLC 
family. For those proteins not closely clustered 
with PtCHLC, we tested the activity of hits from 
a cyanobacterium, a proteobacterium, and a 
moss as well as an algal hit (P. tricornutum 
Phatr3_EG01494). They failed to produce Chl ¢ 
in vitro (fig. S10), although it is possible that 
they function as dioxygenases on different 
compounds. Our sequence alignment revealed 
multiple residues, especially those around the 
HXD/E motif, to be conserved among and spe- 
cific for the active CHLC members but different 
in bacterial hits (fig. S12). Some of these resi- 
dues may have played a role in the emergence 
of the Chl c synthase activity—e.g., by facilitat- 
ing the alteration of substrate specificity. 


Discussion 


Our discovery of the broadly conserved CHLC 
enzyme resolves a major outstanding question 
in photosynthesis. We demonstrate that CHLC 
is a 20G-dependent dioxygenase that synthe- 
sizes Chl c from common chlorophyll biosyn- 
thesis intermediates. Both Chl a and Chl ¢ are 
derivatives of the DVP precursor, and Chl a 
requires more steps to be synthesized (Fig. 1). 
Yet, the adoption of a dioxygenase for the 
production of Chl c suggests its evolutionary 
emergence to be after that of Chl a because 
oxygenic phototrophs almost universally use 
Chl a in the photosystem reaction centers to 
enable water oxidation (44). However, we can 
not exclude the possibility that the uniden- 
tified Chrysista Chl c synthase(s) are oxygen 
independent, considering that many known 
steps in tetrapyrrole biosynthesis can be cat- 
alyzed by evolutionarily unrelated oxygen- 
dependent or oxygen-independent enzymes 
(12). To identify the Chrysista Chl c synthase 
(s), one can again select candidate genes pre- 
dicted to encode oxidoreductases by using 
transcriptomic datasets or comparative ge- 
nomic approaches. Whereas genetic approaches 
are less advanced for Chrysista species com- 
pared with P. tricornutum, the chic mutants 
from our study would allow a facile, heterol- 
ogous expression-based candidate screen based 
on reversion of their green color. 

CHLCis conserved among Diatomista ochro- 
phytes, cryptophyte, haptophytes, and dino- 
flagellates but is absent in red algal plastid and 
nuclear genomes (data S5). It is only distantly 
related to any gene in organisms without Chl c, 
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whether photosynthetic or not. How the Chl c 
synthase activity first came into existence thus 
remains a mystery. Regarding the inheritance 
of CHIC across CASH lineages, the potential 
early and late plastid origins of haptophyte 
and Kareniaceae CHLC genes provide support 
for the hypothesis that plastids serve as “shop- 
ping bags” for the evolution of the host ge- 
nome (45). The observation that cryptophytes 
branch within ochrophytes in the tree of the 
CHLC family (Fig. 4B) is intriguing, consider- 
ing that cryptophytes were hypothesized to 
be the earliest lineage among CASH algae in 
serial endosymbiosis models (39, 41, 42) (fig. 
S1C). This branching topology may be a phylo- 
genetic artifact but could also have resulted 
from a gene transfer from an ochrophyte to an 
ancestor of extant cryptophytes. 

Our chic mutants allow for the investigation 
of the consequences of the loss of Chl c. Chl ¢ 
likely emerged as a replacement to phycobi- 
lisomes for light harvesting with a lower nitro- 
gen, sulfur, and photon cost (46). Because red 
algae already contained chlorophyll-binding 
LHC proteins (9), only modest changes were 
required to accommodate this pigment. In 
addition, Chl c may have other roles related to 
its structural uniqueness. Most Chl ¢ species 
are not esterified to a hydrophobic phytol tail, 
leaving a free carboxyl end that can be re- 
versibly protonated, which has recently been 
hypothesized to permit its versatile functions 
in both harvesting of light and protection from 
excess irradiation in a condition-dependent 
manner (47). Furthermore, the lack of the 
phytol group allows more fucoxanthin to 
bind to the LHC proteins (48). Moreover, 
Chl c was proposed to facilitate energy transfer 
from blue-green light-harvesting carotenoids 
to Chl a in the light-harvesting apparatuses 
(J), and it has now been shown by structural 
approaches that Chl ¢ can interact with both 
fucoxanthin and Chl a in P. tricornutum (48). 
Our results of 77 K chlorophyll fluorescence 
spectra and growth curves under limiting light 
support the role of Chl ¢ in light harvesting, 
and the defects can be partially attributed to 
the secondary effect on fucoxanthin or even 
Chl a accumulation (fig. S5). Regarding the 
lowered relative fucoxanthin content in chlc 
mutants, whether it resulted from biosynthetic 
coordination of the two pigments or whether 
lack of Chl ¢ destabilizes the fucoxanthin- 
chlorophyll protein complexes (48) remains to 
be investigated. 

It should be noted that algal groups in the 
four CASH lineages are different from each 
other in the carotenoid component in the Chl 
c-containing light-harvesting antennae; for 
instance, cryptophytes and many dinoflagel- 
lates use alloxanthin and peridinin instead of 
fucoxanthin (6). It would not be surprising if 
there were distinct aspects in the structural 
and functional roles of Chl c between different 


lineages. The identificaion of CHLC will allow 
researchers to directly target its orthologs in 
cryptophyte, haptophyte, or dinoflagellate spe- 
cies to obtain mutants devoid of Chl c, without 
aneed to screen candidates, once genetic tools 
for these algae become available. Results from 
these efforts will shed light on whether Chl c 
exerts conserved functions across lineages 
beyond its light-harvesting role. 

Our discovery of a Chl c synthase will facil- 
itate synthetic biology efforts to synthesize 
this pigment in other phototrophs for im- 
provements of their light-harvesting capa- 
bility. Algae of the Nannochloropsis genus 
(Eustigmatophyceae) offer an attractive chas- 
sis for such attempts because they are indus- 
trially useful and genetically tractable but lack 
accessory chlorophylls (49). Additionally, their 
ochrophyte identity may facilitate correct tar- 
geting of the CHLC enzyme. 
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The oscillating Fischer-Tropsch reaction 


Rui Zhang’, Yong Wang**, Pierre Gaspard,? Norbert Kruse">** 


The mechanistic steps that underlie the formation of higher hydrocarbons in catalytic carbon monoxide 
(CO) hydrogenation at atmospheric pressure over cobalt-based catalysts (Fischer-Tropsch synthesis) 
have remained poorly understood. We reveal nonisothermal rate-and-selectivity oscillations that are 
self-sustained over extended periods of time (>24 hours) for a cobalt/cerium oxide catalyst with 

an atomic ratio of cobalt to cerium of 2:1 (Co2Ce,) at 220°C and equal partial pressures of the reactants. 
A microkinetic mechanism was used to generate rate-and-selectivity oscillations through forced 
temperature oscillations. Experimental and theoretical oscillations were in good agreement over an 
extended range of reactant pressure ratios. Additionally, phase portraits for hydrocarbon production 
were constructed that support the thermokinetic origin of our rate-and-selectivity oscillations. 


he hydrogenation of carbon monoxide to 
chain-lengthened hydrocarbons was 
initially reported by BASF company in 
1913, where the reaction occurred under 
hydrogen-deficient synthesis conditions 
(7). A decade later, Fischer and Tropsch (2) 
demonstrated hydrocarbon synthesis from 
stoichiometric mixtures with H./CO ratios of 
2 (or even more H,-rich mixtures) that were 
compatible with the “syngas” derived from coal 
gasification and adjusted by water gas shift. 
Mainly iron and cobalt metal particles on a 
suitable support material were found to be 
active catalysts. The underlying mechanistic 
steps of this polymerization-type surface reac- 
tion are still under debate more than 100 years 


Fig. 1. Oscillatory behavior in the FT reaction. 

(A to D) Self-sustained long-term oscillations of the 
temperature over a Co/Ce-oxide catalyst of Co2Ce; 
atomic composition at 220°C, 1 bar total pressure, 
and a H2/CO ratio of 1/1. Oscillating temperatures 
(A) with periods of about 340 s and amplitudes 

of 7 Celsius degrees (for 16 hours) and 2 Celsius 
degrees (6 hours) (B) are observed along with 

a “dissipation” period in between and a short final 
dying-out. The harmonicity of the temperature 
oscillations is demonstrated in (C), and phases of 
temperature are related to those of the reactants and 
selected products after Fourier transformation 

in (D). Black and red dashed lines represent relative 
phases zero and x. C4 products show the same 
behavior. 
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later. No consensus has emerged on the chain- 
lengthening mechanism that leads to hydro- 
carbons and their functionalized derivatives, 
in part because of metal-support synergies that 
range from weak interactions to strong chem- 
ical bonding and compound formation. Also, 
structural and chemical reconstruction effects 
on both the surface and in the bulk of the 
catalytically active metals (3-12) further com- 
plicate mechanistic studies. 

Recent experimental studies have dem- 
onstrated that an all-embracing reaction 
mechanism to produce paraffins, olefins, and 
oxygenates (mainly alcohols and aldehydes) is 
unlikely to exist. Besides metal-support inter- 
actions, process temperatures and variable 
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ted pressure conditions affect the mechan... 
pathways. In this context, we note that Fischer 
originally adopted the view of a “formate-type” 
intermediate (2) after the report by Christiansen 
(13) of the catalytic hydrogenation of methyl 
formate to methanol. However, Fischer soon 
discarded this possibility after realizing that 
excessive carbon deposition from CO over 
alkali-promoted iron catalysts would lead to 
the formation of metal carbides prone to under- 
going hydrogenation to hydrocarbons. Much 
of the present-day research favors a “carbide- 
type” mechanism that involves C-C coupling 
of CH, intermediates. However, experimental 
evidence has also been presented for a CO inser- 
tion mechanism under steady-state conditions 
of the Fischer-Tropsch (FT) process (14-20). This 
mechanism benefits from its ability to account 
for the formation of chain-lengthened oxygen- 
ates besides paraffins and olefins. Reducible 
oxides of metals, in particular, those contain- 
ing manganese, have been demonstrated to 
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strongly promote the catalytic performance of 
cobalt-based catalysts (9-12, 21, 23-25). Rate- 
and-selectivity hysteresis driven by a Co-to-CogC 
reaction-induced structural transformation has 
been reported for the high-pressure synthesis 
of chain-lengthened paraffins, olefins, alcohols, 
and aldehydes (25). Recent reports on using 
CeO, in Co-based catalyst formulations (26) 
highlighted the occurrence of a strong metal- 
support interaction (27-29) that led to the for- 
mation of patchy capping overlayers with 
enriched metal/CeO, interface structures that 
serve as potential reaction sites (30). These 
insights challenged the authoritative view of 
Iglesia et al. (31) that hydrocarbon synthesis 
over Co-based catalysts is support independent. 

Although time-dependent reaction kinetics 
to the point of hysteresis and oscillations 
were already demonstrated in the early 1970s 
for the CO oxidation over Pt-based catalysts 
(32, 33), little is known so far about such 
kinetic instabilities in reaction networks as 
complicated as the catalytic CO hydrogenation. 
Because of possible heat- and mass-transfer 
limitations in solid catalysts, oscillations are 
usually nonisothermal. Indeed, Tsotsis et al. 
(34) observed temperature variations of up 
to 200 Celsius degrees in the CO hydrogen- 
ation over Fe/zeolite catalysts of poor thermal 
conductivity. In this work, we report periodic 
rate-and-selectivity oscillations in the FT re- 
action over a Co/Ce-oxide powder catalyst. Tem- 
perature amplitudes of several Celsius degrees 
were observed and were correlated to the pe- 
riodic activity-and-selectivity changes of the 
Co/Ce-oxide catalyst. The experimental evidence 
allowed us to develop a mechanistic framework 
that shows how observed oscillatory reactions 
can be generated through periodic temper- 
ature forcing. We also present phase portraits 
for oscillatory reactant and product formation 
in agreement with the thermokinetic origin of 
our observations. 


Experimental studies 


We used a fixed-bed flow reactor designed to 
minimize concentration gradients of reactants 
and products to allow the reactor response to 
be monitored as a function of time when 
switching reactant flows between different 
compositions. In the present study, such a 
transient response was obtained after swiftly 
replacing a Hy flow (for catalyst activation and 
to provide a metallic Co phase) in He with a 
reactive syngas mixture of H,/CO in argon. 
Noble gases were used as balance and reference 
along with an independent flow of Ne that 
bypassed the reactor to allow molecular flows 
of reactants and products to be calculated, 
assuming negligible concentration gradients 
in the reactor and laminar flow in the pipes 
leading to the analytical devices. The experi- 
mental procedure that we adopted to generate 
rate-and-selectivity oscillations of the atmo- 
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Fig. 2. Effect of H2/CO ratio and temperature on the oscillatory behavior over a Co/Ce-oxide catalyst 
with Co2Ce, atomic composition. (A) Evolution of temperature and rate-and-selectivity oscillations (as reactor 
outlet flows of reactants and products) as a function of the H2/CO ratio at a nominal temperature of 220°C. The 
horizontal black dashed line represents the CO molecular flow into the reactor and serves to calculate time- 
averaged reaction rates. Dashed vertical lines distinguish reactant ratios between H2/CO = 1 and 12. oi(t), reactor 
outlet flow. (B) CO conversion and product selectivities for varying H2/CO ratios at 220°C. (C) Time evolution 
of oscillations at 200°C (top) and 260°C (bottom) for selected H2/CO ratios. Red bars in the top graph reflect 
the variation of the oscillation period with time and the H2/CO ratio. 


spheric-pressure FT reaction over Co/ceria is 
shown in fig. S1. 

After reducing conditions in Hz switched to 
reactive conditions in H,/CO, while the flow 
rate was kept constant, the catalytically active 
surface phase began to form. Such chemical 
conditioning always preceded the onset of 
oscillations and was associated with delay 
times of some hundreds of seconds, depend- 
ing on the actual H./CO ratio (which varied 
from 1 to 12) and Co/Ce atomic ratios in Co/Ce- 
oxide catalysts (Co/Ce atomic ratios varied 
from 0.5 to 4). For the conditions shown in 
Fig. 1, long-term oscillations in temperature 
with periods of about 340 s and amplitudes of 
~7 Celsius degrees were obtained for Co/Ce = 2 
and H,/CO = 1. After 15 hours, the amplitudes 
decreased to ~2 Celsius degrees (Fig. 1, A and B, 
and fig. S3). Such changes in amplitude were 
not abrupt but rather sluggish and occurred on 
timescales of ~1 hour. The origin of amplitude 
damping is not clear but could be associated 
with changes in the surface composition of the 
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Fig. 3. Phase portrait of CH, selectivity versus 
temperature at different H2/CO ratios. The experi- 
mental conditions are the same as in Fig. 2. Phase 
portraits, or phase-space trajectories, are constructed 
from extended time series of the oscillating product 
flows. Transitions to higher cycles are caused 
whenever the H2/CO ratio is increased. The red arrows 
indicate the direction of time evolution. 
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Fig. 4. Reduced reaction 
scheme that underlies 
theoretical modeling 
with forced temperature 
oscillations. Chain 
lengthening involves CO 
insertion into O-H or O-R 
bonds of surface hydroxy 
or alkoxy, respectively 
(critical bonds are high- 
lighted in red). Methane 
formation occurs through 
either an “early” pathway 
via CHy hydrogenation or a 
“late” pathway involving 
hydrogenation of surface 
formate. Only the late 
pathway develops oscilla- 
tory behavior, as do higher 
hydrocarbons. Generally, 
the reaction scheme 
lumps together several 
elementary steps to 
enable the mathematical 
treatment. Water gas shift 
is an integral part of the 
scheme. Possible surface 
reconstructions (due to 
Co-to-Co2C or CeQ0>-to- 
Ce203 chemical transfor- 
mations) have not been 
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considered, in accordance with x-ray diffraction results shown in fig. S7. The details of this reaction scheme 


along with the kinetic approach can be found in the s 


catalyst from the adsorption of contaminating 
species. 

Oscillating temperatures reflected periodic 
changes in the exothermicity of the reaction. 
In Fig. 1C, we show a selected time window to 
exemplify the harmonicity of the high-amplitude 
oscillations. Thermal coupling between remote 
catalyst particles had to be highly efficient to 
ensure the apparently concerted reaction behav- 
ior across the catalyst bed. In this context, no 
oscillatory behavior was observed after replac- 
ing Ar with He as the vector gas. The fast heat 
exchange between catalyst particles and gas 
phase that is enabled by the high thermal 
conductivity of He was lost after replacement 
with Ar (for details, see the supplementary 
materials). 

Phase differences between temperature oscil- 
lations and reactants and products (Fig. 1D) 
showed that temperature maxima and minima 
correlated with enhanced and reduced prod- 
uct formation, respectively, whereas reactants 
were out of phase with both temperature and 
products. These interdependencies were also 
apparent from phase portraits, as will be shown 
for the case of methane formation. 

Figure 2 demonstrates the oscillatory behav- 
ior as a function of the H,/CO pressure ratio 
and temperature. Although this pressure ratio 
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upplementary materials. 


has been set to between 1 and 3 in most inves- 
tigations of the FT reaction, we have chosen to 
extend it up to 12 after our previous studies 
over Co/Mn-oxide catalysts, which showed rate 
and selectivity hysteresis to endure a large 
excess of Hy (25). Figure 2A demonstrates 
the exceedingly broad existence range of tem- 
perature and selected chem-species’ oscilla- 
tions as the H,/CO ratio varies from 1 to 12. 
As this ratio is increased from low to high, 
the oscillation period decreased from ~340 to 
~230 s and the amplitude decreased from 7 to 
1 Celsius degree, respectively (fig. S3). Time- 
averaged reactor outlet flows decreased for CO 
and increased for all product species (exempli- 
fied for methane and propene in Fig. 2A), as 
expected for more-reducing reaction conditions. 

Although the CO conversion increased from 
an initial value of 14% at H./CO = 1 to a final 
value of 81% at H2/CO = 12, the product spe- 
cies exhibited a more-differentiated picture 
(Fig. 2B). Less-hydrogenated species, such as 
olefins, were formed preferentially at low H./CO 
ratios, as had been reported for promoted Co/Mn- 
oxide catalysts in the absence of long-term os- 
cillations (9, 10, 23). According to high-resolution 
electron microscopy studies, a Co-to-CoC phase 
transformation occurred under these hydrogen- 
deficient conditions (9, 10). Olefin formation, 
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especially for ethylene, decreased at increasing 
H,/CO ratios (Fig. 2B). The formation of meth- 
ane, the most hydrogenated product, became 
dominant at higher H,/CO ratios and reached 
~90% selectivity at H./CO = 12. Although some 
chain lengthening to C,,. paraffins was retained, 
early chain termination was favored under these 
strongly reducing conditions. 

Figure 2C illustrates that the oscillatory 
behavior described so far only occurred within 
a limited range of temperatures. At 200°C, 
regular oscillations were observed over a wide 
range of H,/CO pressure ratios. Similar to 
observations made for a reaction temperature 
of 220°C (as mentioned earlier), oscillation 
periods decreased with increasing H./CO ratio. 
At 260°C, oscillations with fading amplitude 
were observed for H,/CO = 1. The dissipation 
was then followed by the extinction of any 
oscillatory behavior at higher H,/CO ratios. 
We note that Co/Ce-oxide catalysts with Co/Ce 
atomic compositions that were appreciably dif- 
ferent from 2 (Co/Ce = 0.5, for example; see fig. 
S4) did not exhibit oscillatory behavior (fig. S4). 

The phase-space trajectories of the methane 
outlet flow versus temperature are shown in 
Fig. 3. For each H./CO ratio, the trajectory 
formed a cycle with the orientation expected 
for a mechanism of thermokinetic instability 
(35, 36). In this mechanism, a thermokinetic 
feedback caused by the high exothermicity of 
the FT reaction produced an increase of the 
temperature and an acceleration in the con- 
sumption of the adsorbed reactants, which 
then slowed the reaction and closed the cycle. In 
this way, the thermokinetic instability may trig- 
ger self-sustained oscillations (37), as observed 
in our research and described above. 


Theoretical analysis 


A deeper understanding of the intrinsically 
time-dependent reaction kinetics described thus 
far required a theoretical analysis of the reaction 
mechanism, the details of which are presented 
in the supplementary materials. A reduced 
scheme that summarizes the essential aspects 
of this mechanism is shown in Fig. 4. In par- 
ticular, we assume hydrogen-assisted dissocia- 
tion of CO (J5, 38) to provide O-H surface 
hydroxyl and partially hydrogenated carbon spe- 
cies, followed by hydrogenation of the latter to 
produce methane. Importantly, the O-H bond 
of OH undergoes an insertion reaction with CO 
to form surface formate-type species as a key 
surface reaction intermediate. A similar reaction 
scenario has also been envisaged for the meth- 
anol synthesis (39-41) over Cu-based catalysts. 
However, potential variations in the denticity of 
surface-bound formate, which could lead to 
different reactivities (42, 43), have not been 
considered in our reaction scheme. The occur- 
rence of CO insertion in the atmospheric FT 
reaction has been previously demonstrated in 
chemical transient studies using Co/MgO (14), 
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Fig. 5. Experimental and theoretical flow rates under conditions of self-sustained nonisothermal 
oscillations and forced temperature oscillations. (A) Time evolution of temperature and outlet flows 
under the same experimental conditions as in Fig. 2. (B) The same variables simulated with the theoretical 
kinetic model described in the supplementary materials for forced temperature oscillations. The outlet flow of 
CO is out of phase with temperature and products for the theoretical model in (B) as well as for the 


experimental observations in (A). 


Co/Mn-oxide (16), and other catalyst compo- 
sitions. The respective kinetic evidence for 
Co/Ce-oxide is also provided in the supplemen- 
tary materials. 

Chain lengthening involves surface hydro- 
genation of formate followed by CO insertion 
into the nascent surface O-R bond, where R is 
an alkyl group. Repetitive partial hydrogenation 
of surface carboxylate and CO insertion into 
the respective alkoxy O-R bond then provides 
chain lengthening. The reaction scheme also 
considers CO, formation from surface formate 
and may be completed to allow for olefin for- 
mation and hydrocarbon functionalization. 

An essential feature of our reaction mecha- 
nism is the relatively late water rejection from 
the adsorbed layer. This result is in full agree- 
ment with results obtained from chemical 
transient kinetics for various catalyst systems 
(14, 16, 44). In the case of Co/Ce-oxide, fig. S2 
shows delay times for water formation of sev- 
eral hundred seconds as compared with several 
seconds for methane formation. Reversible 
Ce**-to-Ce** reduction has not been considered 
in our reaction scheme nor has a Co-to-Co,C 
transformation been accounted for. Including 
such structural changes in a complex kinetic 
reaction scheme like the FT reaction is chal- 
lenging. Furthermore, no experimental evidence 
to support the occurrence of these transfor- 
mations under atmospheric reaction conditions, 
as applied in our study, has been obtained (for 
details, see the supplementary materials). 

Figure 5 compares the results of forced 
temperature oscillations with self-sustained 
nonisothermal oscillations as observed experi- 
mentally. The temperature dependence was 
assumed to arise mainly from Arrhenius fac- 
tors associated with C-O bond breaking. The 
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level of agreement between the two is deemed 
to be excellent. Time-averaged outlet flows of 
reactants and products deviate by about 20% 
(and usually much less) from each other over 
the entire range of H,/CO ratios. We note that 
this level of agreement is strongly dependent 
on the values of the kinetic constants that are 
used (see supplementary materials). Earlier 
observations of large-amplitude (100° to 200°C) 
and very-long-period (usually 60 to 80 min) 
oscillations in the FT reaction at higher pre- 
ssures (10 to 22 bar) did not provide deep 
mechanistic understanding (34, 45). In addi- 
tion to heat- and mass-transfer limitations, 
reversible deactivation through carbon depo- 
sition (34, 46) and oxidation-reduction cycles 
were suggested to provide a chemical feed- 
back for oscillations in temperature and either 
in-phase CO (45) or out-of-phase CO conver- 
sion (34). 


Discussion 


We report the discovery of rate-and-selectivity 
oscillations in the FT reaction over cobalt/ceria 
catalysts. The observations are modeled with a 
kinetic scheme based on the CO insertion 
mechanism, the thermal activation of C-O bond 
breaking, and periodic temperature forcing. 
Previous kinetic modeling studies that used the 
concept of forced oscillations to replicate self- 
sustained oscillatory behavior have generally 
been on much-less-complicated reaction net- 
works. Ultimately, it would be of interest to 
generate self-sustained oscillatory behavior on 
the basis of a thermokinetic feedback mecha- 
nism without forcing the temperature to oscillate. 

From an experimental point of view, the use 
of a nearly gradient-free microreactor turned 
out to be advantageous for our mean-field 
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kinetic modeling. Although heat transport 
through the gas phase ensures “global” syn- 
chronization over the catalyst bed, local cover- 
ages at the surface of individual particles may 
vary greatly, thus causing surface diffusion 
and spatiotemporal self-organization. Such phe- 
nomena have been studied in great detail under 
isothermal and ultrahigh vacuum conditions 
with oriented single crystals (47) but escape 
detectability in ensemble-averaging analyses 
with support catalysts at ambient pressures. 
With regard to the FT reaction, kinetic studies 
under molecular flow conditions in ultrahigh 
vacuum are incongruous because of the strong 
pressure dependence related to the hydrogen 
reactant. 

We anticipate that our approach will spawn 
further research to determine the appropri- 
ate parameters for triggering and analyzing 
kinetic instabilities of catalytic reactions that 
are as complicated as the FT reaction. This 
discovery can provide insight into the reaction 
pathways that lead to paraffins and olefins 
over a wide range of H,/CO partial pressure 
ratios and benefit present FT research, up to 
the point of astrochemistry, where the re- 
action is being considered as a candidate for 
producing complex molecules from simpler 
ones (48-50). The results of our studies re- 
inforce the concept of a dynamic catalyst de- 
sign (57) to improve the performance in terms 
of rate and selectivity by using oscillatory re- 
action states. Concurrent theoretical analyses 
under thermokinetic conditions may guide 
the mechanistic understanding of reactions 
that are as complex as the FT synthesis. 
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More than 10,000 pre-Columbian earthworks are still 
hidden throughout Amazonia 


Vinicius Peripato et al. 


Indigenous societies are known to have occupied the Amazon basin for more than 12,000 years, 

but the scale of their influence on Amazonian forests remains uncertain. We report the discovery, using 
LIDAR (light detection and ranging) information from across the basin, of 24 previously undetected 
pre-Columbian earthworks beneath the forest canopy. Modeled distribution and abundance of large-scale 
archaeological sites across Amazonia suggest that between 10,272 and 23,648 sites remain to be 
discovered and that most will be found in the southwest. We also identified 53 domesticated tree species 
significantly associated with earthwork occurrence probability, likely suggesting past management 
practices. Closed-canopy forests across Amazonia are likely to contain thousands of undiscovered 
archaeological sites around which pre-Columbian societies actively modified forests, a discovery 

that opens opportunities for better understanding the magnitude of ancient human influence on 


Amazonia and its current state. 


uring the pre-Columbian era, Amazonia 

was home to dense and complex socie- 

ties throughout its vast forested area 

spanning 6.7 million km? (J). These an- 

cient Indigenous societies had profound 
knowledge of earthmoving, riverine dynamics, 
soil enrichment, and plant and animal ecology, 
which allowed them to create domesticated 
landscapes that were more productive for hu- 
mans (2-4). With earthmoving techniques, In- 
digenous peoples created a wide variety of 
earthworks (i.e., ring ditches, geoglyphs, ponds, 
and wells), mostly between 1500 and 500 years 
before present, with social, ceremonial, and 
defensive functions (5). Around these earth- 
works, they also managed hundreds of tree 
species, some of which show evidence of do- 
mestication (6-9), and effected long-lasting 
changes in forest composition (J0-13). The 
scale and intensity of that landscape trans- 
formation remain unknown, in part because 
there has never been a comprehensive in- 
ventory of pre-Columbian sites across the 
basin. 

Domesticated landscapes in Amazonia have 
mostly been discovered by means of evidence 
from on-the-ground surveys (5, 14). Earthworks 
can be detected by orbital optical satellites with 
very high spatial resolution (75), but that tech- 
nique is mostly suitable for deforested areas (16). 
Airborne light detection and ranging (LIDAR) 
data—a remote sensing technique that can map 
microtopography beneath the forest canopy— 
has substantially changed our understanding 
of the magnitude of pre-Columbian urbanism 
in Mesoamerica (17, 78) and South America 
(19). Over the past decade, the use of LIDAR 
data has revealed the complexity of Mayan 
civilization by indicating a regionally inte- 


All authors and affiliations are listed at the end of this Research 
Article. 


Peripato et al., Science 382, 103-109 (2023) 


grated urban-rural community network in 
Mesoamerica (17). More recently, LIDAR en- 
abled the detailed mapping of two monumen- 
tal pre-Columbian settlements in an intensively 
domesticated landscape hidden under forest in 
southwestern Amazonia (19). Although Meso- 
american archaeological sites feature very 
different types of structures—stone construction 
as opposed to the use of earth, as in Amazonia— 


e Earthworks (937) 


+ New discoveries (24) 
e Other archaeological sites 
C3Amazonian regions 


4 


LIDAR technology has substantially impri Che 

A A upe 
our spatial understanding of archaeolog--— 
sites in forested landscapes by enabling the 
visualization of ancient large-scale earthworks 
(78, 19) beneath the forest canopy. Because de- 
forestation in Amazonia has removed about 
17% of the natural vegetation cover to date 
(20), LIDAR has the potential to reveal many 
more discoveries in the remaining 83% of the 
basin that is opaque to other remote sensing 
approaches. 

Here, we report a large number of previous- 
ly undocumented pre-Columbian earthworks 
with geometrically patterned enclosures in an 
Amazon-wide LIDAR dataset covering 0.08% 
of the basin (27). We combine these newly dis- 
covered sites with a comprehensive dataset 
of existing archaeological sites (ring ditches, 
geoglyphs, ponds, and wells) to model areas 
likely to harbor as yet undetected earthworks 
hidden beneath remote forest landscapes. On 
the basis of our predictive model, we estimate 
the number of undocumented earthworks and 
identify domesticated tree species associated 
with earthwork presence. 


Archaeological discoveries beneath the canopy 

Scanning 5315 km? of LIDAR data originally 
obtained for estimating aboveground biomass 
throughout the Amazonian forest (22) revealed 


Fig. 1. Geographical distribution of known and newly discovered pre-Columbian geometric earth- 
works in Amazonia. (A) Map of previously reported and newly discovered earthworks (purple circles and 
yellow stars, respectively) reported in this study across six Amazonian regions: central Amazonia (CA), 
eastern Amazonia (EA), Guiana Shield (GS), northwestern Amazonia (NWA), southern Amazonia (SA), 
and southwestern Amazonia (SwA). (B) Newly discovered earthworks in SA. (C to F) Newly discovered 
earthworks in SwA. (G to I) Newly discovered earthworks in GS. (J and K) Newly discovered earthworks 


in CA. Scale bars, 100 m. 
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24 unreported earthworks in southern, south- 
western, central, and northern (the Guiana 
Shield) Amazonia (Fig. 1A) (22D. We detected a 
fortified village in southern Amazonia (Fig. 1B), 
defensive and ceremonial sites in southwestern 
Amazonia (Fig. 1, C to F), crowned mountains 
and megalithic structures in the Guiana Shield 
(Fig. 1, G to D, and riverine sites on floodplains 
in central Amazonia (Fig. 1, J and K). 

In southern Amazonia, we found an ancient 
plaza town located in the Upper Xingu Basin 
(Fig. 1B). This region is known to have sup- 
ported dense populations in the past, distrib- 
uted throughout plaza villages interconnected 
by road networks and surrounded by domes- 
ticated landscapes with a diverse array of ter- 
restrial and aquatic resources (JO, 23). It is also 
clear that the earthworks in this region extend 
beyond the sampled area of the 200-m-wide 
LIDAR transect, restraining their full iden- 
tification. The layout of these earthworks is 
similar to that of other fortified villages doc- 
umented in this region, which supports the idea 
that these structures were built before European 
contact (10, 15, 24). 

In southwestern Amazonia, we found a com- 
bination of rectangular and circular features, 
known as geoglyphs, without detectable in- 
terconnecting roads occurring on flat terrain 
close to water bodies (Fig. 1, C to F). Docu- 
mented defensive and ceremonial earthworks 
in this region were built around two millennia 
ago and are dispersed across the well-drained 
plateaus of the tributaries of the Purus and 
Madeira rivers (25). 

In the Guiana Shield, we detected a combi- 
nation of rectangular and circular features on 
plateaus near water bodies (Fig. 1, G to I). The 
region holds different types of earthworks with 
different usages: permanent settlements within 
crowned mountains in French Guiana (26) and 
ceremonial sites featuring megalithic struc- 
tures arranged in circular clusters found along 
the coast of Amapa, Brazil (27). 

In the floodplains of central Amazonia, a 
hotspot of pre-Columbian riverine settlements 
(3, 23, 28), we identified two other earthworks 
(Fig. 1, J and K). We considered these sites to 
be anthropogenic because of their straight 
edges, although the geometry of these sites is 
distinct from that of the earthworks found in 
upland forests. Constant sedimentary deposi- 
tion over the centuries, through periodic floods, 
may have buried smaller features, preserving 
only the observed structures, which elsewhere 
have been associated with pre-Columbian fish- 
eries management (29). 


Modeling basin-wide distribution of earthworks 


By extrapolating the density of earthworks ob- 
served in our LIDAR data (0.0062 earthworks/ 
km7”) to the extent of Amazonia (6.7 million km”), 
we calculated that >41,000 earthworks may 
occur throughout the forest. However, given 
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that our LIDAR data covered only 0.08% of the 
total area of Amazonia and that earth-building 
societies were not evenly distributed across the 
basin (15, 30), more-rigorous methods were 
needed to estimate how many other as yet un- 
documented pre-Columbian earthworks might 
occur and where. To answer these questions, 
we used newly developed Bayesian statistical 
techniques and an inhomogeneous Poisson 
process (IPP) model (37), with an intensity func- 
tion using intensity covariates and thinned by 
observability covariates (32). Recently, the use 
of other machine learning techniques such as 
random forests have become popular for spe- 
cies distribution models (SDMs). There is still 


some uncertainty about this use (33), and the 
implementation of random forests to IPPs is 
still not available, but it might be a welcome 
addition to the toolkit of SDM analysis. 

The aforementioned statistical analysis was 
based on the records of 937 known earthworks 
complemented by our discoveries (24 earth- 
works), with three bioclimatic, three edaphic, 
and three topographic variables as intensity 
covariates. More than 40 variables were con- 
sidered in the model (table S1), and the se- 
lected ones (nine variables) cover gradients of 
temperature, precipitation, soil structure and 
fertility, topography, water-table depth, and 
distance to water bodies (27). Observability 
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Fig. 2. Probability model of pre-Columbian earthworks across Amazonia. (A) Predicted probability of 
earthwork presence for 1-km? cells across six Amazonian regions using an inhomogeneous Poisson process 
predictive model: central Amazonia (CA), eastern Amazonia (EA), Guiana Shield (GS), northwestern Amazonia 
(NwA), southern Amazonia (SA), and southwestern Amazonia (SwA). Areas not modeled (NA) are greyed 
out. (B) Predictive probability function for the number of as yet undetected earthworks; the dark area 
under the curve represents the credibility interval (Cl) of the probabilities associated with each number. 
(C) Boxplot of the estimated relative contribution of each covariate; the yellow diamond indicates the mean 
value. SCC, soil cation concentration; TPI, terrain position index; HAND, height above the nearest drainage. 
(D) Individual predicted probability of earthwork presence against intensity covariates. For projected areas 
across each Amazonian region on different probability thresholds, see table S2, and for the IPP model 


on continuous values, see fig. S1. 
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Fig. 3. Significant relationships between the occurrence and abundance of domesticated tree species 
and the modeled distribution of earthworks in Amazonia. Point estimates and confidence intervals of 
species significantly associated with predicted probability of earthwork presence, with an overall significance level 


of 5%. Positive species are more likely to occur and be 
presence is high, whereas negative species are less like 


covariates were used to describe the dataset 
sample preference by indicating the most fa- 
vorable location for sample acquisition (32). 
The effect of sample selection bias was individ- 
ually weighted for each sample (27). 

Our model predicts the number of as yet 
undiscovered pre-Columbian structures at be- 
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abundant where predicted probability of earthwork 
ly to occur and be abundant there. 


tween 10,272 and 23,648, with 95% probabil- 
ity, giving an average of 16,187 sites (Fig. 2B). 
These estimates suggest that the earthworks 
already documented in the Amazon to date 
account for a mere 4 to 9% of the total, and 
that 91 to 96% of Amazonian earthworks re- 
main undiscovered. 
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This predictive model indicated that earth- 
works are likely concentrated in southwestern 
Amazonia (Fig. 2A) and corroborated previous 
studies that found this region to be a hotspot 
of earth-building societies (73, 15, 34). In addi- 
tion, nearly all the highest-probability cells (=25% 
predicted probability) occur in a 94,713-km? 
rectangle that overlays a substantial portion of 
the Brazilian state of Acre. Indeed, southwest- 
ern Amazonia contains the earliest plant cul- 
tivation and domestication (9, 35), the oldest 
anthropogenic soils (35), low-density urban- 
ism (19), and now a much higher density of 
earthworks. The underlying spatial data dis- 
tribution may offer valuable information about 
pre-Columbian practices before European con- 
tact (36). 

Our analysis also suggests that pre-Columbian 
societies engaged in earthwork construction 
in all other regions, covering a broader area 
than previously thought. However, earthworks 
are heterogeneously distributed across Ama- 
zonian regions. Almost 80% of the basin has 
a 0 to 1% predicted probability of earthwork 
presence for 1-km” cells. These low-probability 
areas are mostly located in northwestern, north- 
ern, and central Amazonia, whereas higher- 
probability areas (>25% predicted probability, 
covering 1.41% of the basin) are located in 
southwestern Amazonia. Earth-building soci- 
eties were very common in some parts of the 
basin, but they may not have occupied all of 
Amazonia (6, 15, 30, 37). Other types of domes- 
ticated landscapes, such as Amazonian dark 
earths, are widespread [see maps in (37-39)] 
in regions (e.g., central Amazonia) where the 
earthworks analyzed in our study (ring ditches, 
geoglyphs, ponds, and wells) are not commonly 
found. Given the diversity of pre-Columbian 
societies and their land-use practices over 
12,000 years of ancient Amazonian history, 
forests were likely modified at varying inten- 
sities by different Indigenous populations 
through time (7, 38). 

Forests modified by earth-building societies 
are more likely to occur in locations with high 
temperature and low precipitation during the 
wettest and driest quarters (Fig. 2, C and D). 
Areas with high soil content of clay and silt 
and high cation concentrations also show high 
probabilities of earthwork presence. In ad- 
dition, earthworks tend to be located on pla- 
teaus with deep water tables, yet close to water 
bodies. This combination of environmental 
conditions probably facilitated the construc- 
tion of earthworks by offering periods with 
less precipitation and higher temperature, 
and soils with a better texture for earthmoving. 
In addition, the presence of a drier season fa- 
cilitates burning, which could help remove the 
vegetation for building earth structures (72), 
while higher soil cation concentrations could 
attract settlements for the development of di- 
versified food production systems with plants 
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managed and domesticated to different de- 
grees (15, 30). 

As expected, observability covariates indi- 
cate that previously reported earthworks are 
mostly found near roads, which facilitate 
field research (Fig. 2C). Tree cover, however, 
has no effect on the current distribution of 
earthworks. Thus, new earthworks can still be 
found even in deforested areas. The use of con- 
ventional very-high-resolution remote sensing 
data, guided by the probability surfaces pro- 
duced here (Fig. 2A), is likely to reveal more 
previously undetected earthworks in both 
closed-canopy and deforested areas of Ama- 
zonia. In addition, the rise of machine learn- 
ing techniques applied to archaeological site 
detection may lead to rapid discovery of new 
sites across deforested areas (40, 41). 

In forested areas, LIDAR surveys guided by 
our discoveries (e.g., full coverage of the Fig. 1B 
site) and the probability surfaces in Fig. 2A are 
promising tools for discovering new sites. 
However, very-high-probability areas (250% 
predicted probability) cover 32,120 km?, for 
which a complete LIDAR survey would require 
six times more data than have been collected 
to date in the Amazon. Thus, other approaches, 
such as mapping the distribution and abun- 
dance of domesticated species associated with 
earthwork presence, may help locate new sites 
within the Amazonian forest (42, 43). 


Relationships with domesticated species 


We analyzed the relationship between the re- 
sponse (occurrence and abundance) of 79 do- 
mesticated tree species identified across 1676 
forest plots (6) and the predicted probability 
of earthwork presence using generalized linear 
models to test whether forests with a higher 
probability of earthwork presence have a higher 
frequency and abundance of domesticated 
species (27). The occurrence and/or abundance 
of 35 domesticated species increased with the 
predicted probability of earthwork presence, 
while those of 18 species decreased. In total, 
the occurrence and/or abundance of 53 of the 
79 domesticated species showed significant 
association with the predictive model of earth- 
work distribution (Fig. 3). 

The species whose responses increased the 
most significantly along with the probability of 
earthwork occurrence are Bertholletia excelsa 
(P < 0.001, B = 1.13), Hevea brasiliensis (P < 
0.001, B = 0.65), and Brosimum alicastrum (P < 
0.001, B = 1.36), on the basis of occurrence data, 
and Astrocaryum murumuru (P < 0.001, B = 
0.71), Attalea phalerata (P < 0.001, 8 = 1.42), and 
Theobroma cacao (P < 0.001, 8 = 1.48), on the 
basis of abundance data (fig. S2). The species 
whose responses decreased the most signifi- 
cantly are Erisma japura (P < 0.001, B = -1.94), 
on the basis of occurrence data, and EF. japura 
(P < 0.001, B = -1.7) and Oenocarpus bataua (P < 
0.001, 8 = -0.27), on the basis of abundance data 
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(fig. S2). Although these highlighted species have 
multiple uses (44), they have mainly been used 
for their edible fruits and nuts in Amazonia, 
with the exception of H. brasiliensis, which 
has been used intensively for latex produc- 
tion (data S1). Species that are more frequent 
and abundant in forests with higher probability 
of earthwork occurrence were probably fa- 
vored by a combination of interacting past 
Indigenous management practices and eco- 
logical processes (6). These results confirm 
previous archaeobotanical and ethnobotanical 
data that have already shown that some spe- 
cies (e.g., B. excelsa, Astrocaryum spp., and 
Attalea spp.) are more abundant on and near 
archaeological sites across Amazonia (8, 14, 36). 
Species that are less frequent and abundant in 
areas with a higher probability of earthwork 
occurrence likely prefer habitats where earth- 
works are usually not found, such as sandy soils 
with lower fertility (7), or were disfavored by 
past practices that might have had detrimen- 
tal effects on some species (45). 


Social-ecological implications 


The massive extent of archaeological sites and 
widespread human-modified forests across 
Amazonia is critically important for establishing 
an accurate understanding of interactions be- 
tween human societies, Amazonian forests, and 
Earth’s climate (37). Considering the widespread 
extent of locations modified by pre-Columbian 
management and cultivation practices, Amazonia 
can be viewed as an ancient social-ecological sys- 
tem, with long-term responses to climate change 
(46), more similar to old secondary forests than 
pristine climax ecosystems (10). 

The discovery of earthworks hidden beneath 
dense forest canopies also indicates that, given 
sufficient time after these sites became depop- 
ulated, forests regenerated over the centuries. 
It is still unknown, however, the scale of struc- 
tural and floristic differences between pristine 
and domesticated forests across Amazonia. The 
forest reclaimed the land, but this is not the 
case for the Indigenous societies that managed 
these forests and waterbodies and that created 
these large structures. These archaeological leg- 
acies can play a role in present-day debates 
around Indigenous territorial rights. They serve 
as tangible proof of an ancestor’s occupation, 
way of life, and their relationship with the for- 
est. Today, Indigenous peoples struggle to re- 
cognize their right to land originally inhabited 
by their ancestors, along with the protection 
of their territories, languages, cultures, and heri- 
tages. In addition to protecting the native peo- 
ples that remain, the institution of Indigenous 
lands also collaborates with forest conservation 
in times of debates on climate change and the 
search for solutions that minimize impacts on 
the climate and promote carbon neutrality. 

These human-modified landscapes harbor 
an impressive archaeological heritage. Of the 
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24 earthworks newly reported in our study, 
50% are located in areas with some degree of 
legal protection. When all 937 known earth- 
works are considered, however, only 9% are 
located inside Indigenous lands and protected 
areas. To date, most pre-Columbian earth- 
works have been discovered after deforesta- 
tion. The highest density of known earthworks 
in Amazonia is, therefore, outside protected 
areas and mostly located in the region with 
the highest historical and current rates of de- 
forestation, called the “Arc of Deforestation.” 
Protected areas and Indigenous territories can 
act as barriers against illegal activities that 
promote the degradation and destruction of 
Amazonia’s natural and cultural heritage, but 
their implementation and expansion depend 
on strong government policies and law en- 
forcement (47, 48). 

Tronically, modern-day deforestation is re- 
moving the very evidence of pre-Columbian 
land-use strategies that were able to trans- 
form the landscape without causing large- 
scale deforestation (13). Today, Amazonia is 
experiencing expansion of agriculture and 
cattle ranching (49, 50), especially where earth- 
works are concentrated in the southern and 
southwestern regions, risking the destruction 
of earthworks and fracturing and hampering 
the identification of pre-Columbian occupation 
sites that provide direct evidence of ancient 
Indigenous territories. Our data on earthwork 
probability, suitable environmental conditions, 
and associated domesticated species should 
narrow the search for Indigenous heritage 
sites, enhanced by optical and LIDAR sensing 
to identify, monitor, and help conserve archae- 
ological features. Amazonian forests clearly 
merit protection not only for their ecological 
and environmental value but also for their 
high archaeological, social, and biocultural 
value, which can teach modern society how 
to sustainably manage its natural resources. 
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Structure of a ribonucleotide reductase R2 


protein radical 


Hugo Lebrette“**}, Vivek Srinivas‘, Juliane John’, Oskar Aurelius’, Rohit Kumar’, Daniel Lundin’, 
Aaron S. Brewster*, Asmit Bhowmick’, Abhishek Sirohiwal’, In-Sik Kim*, Sheraz Gul‘, Cindy Pham‘, 

Kyle D. Sutherlin’, Philipp Simon‘, Agata Butryn®-°t, Pierre Aller®®, Allen M. Orville™®, Franklin D. Fuller’, 
Roberto Alonso-Mori’, Alexander Batyuk’, Nicholas K. Sauter’, Vittal K. Yachandra‘*, Junko Yano*, 

Ville R. |. Kaila’, Britt-Marie Sjoberg’, Jan Kern**, Katarina Roos®§, Martin Hégbom"* 


Aerobic ribonucleotide reductases (RNRs) initiate synthesis of DNA building blocks by generating a free 
radical within the R2 subunit; the radical is subsequently shuttled to the catalytic R1 subunit through 
proton-coupled electron transfer (PCET). We present a high-resolution room temperature structure of 
the class le R2 protein radical captured by x-ray free electron laser serial femtosecond crystallography. 
The structure reveals conformational reorganization to shield the radical and connect it to the 
translocation path, with structural changes propagating to the surface where the protein interacts with 
the catalytic R1 subunit. Restructuring of the hydrogen bond network, including a notably short O-O 
interaction of 2.41 angstroms, likely tunes and gates the radical during PCET. These structural results help 
explain radical handling and mobilization in RNR and have general implications for radical transfer in proteins. 


ncontrolled or unmitigated free radicals 

can cause damage to cells but radicals 

are also essential to numerous meta- 

bolic pathways and enzyme-mediated 

chemistry (/, 2). Ribonucleotide reduc- 
tase (RNR) is an archetypal radical enzyme, 
and the tyrosyl radical (Ye) in the R2 subunit 
from Escherichia coli was the first stable pro- 
tein radical to be observed, 50 years ago (3). 
RNR provides the only pathway for de novo 
synthesis of deoxyribonucleotides and repre- 
sents a drug target for both cancer and infec- 
tious diseases (4, 5). Aerobic RNR (class I) 
depends on the ferritin-like R2 subunit to gen- 
erate a catalytic radical in an oxygen-dependent 
reaction; the radical must be transferred back 
and forth with the catalytic R1 subunit, which 
performs the ribonucleotide reduction (6). Radi- 
cal translocation between the subunits proceeds 
through reversible long-range proton-coupled 
electron transfer (PCET) in a transient R1-R2 
complex (7). Radical transfer initiation in- 
volves redox-induced structural changes in R2 
(8, 9), conformational gating (10), short-range 
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proton transfer (77) coupled to long-distance 
electron transfer (72), and regulation by R1 (7). 
Recently, the structure of an R1-R2 holo- 
complex was determined by cryo-electron mi- 
croscopy (cryo-EM) (7) providing a picture of 
the long-range radical transfer pathway. How- 
ever, atomic resolution snapshots of the radi- 
cal state and the conformational gating taking 
place at the R2 active site remain unresolved. 

Most R2 proteins harbor a conserved tyro- 
sine residue oxidized to Ye by an oxygen- 
activated metal center in the catalytically active 
state. In a recently discovered active metal-free 
R2 subclass, denoted R2e, this tyrosine residue 
is post translationally meta-hydroxylated to a 
3,4-dihydroxyphenylalanine (DOPA) which serves 
as the radical-harboring residue (13, 14). Oxygen 
activation of R2e and metal-containing R2 pro- 
teins display analogous pathways consisting 
of a nonactivated state, a catalytically active 
radical state, and a radical-lost ground state 
(Fig. 1A). 

From an experimental point of view, R2e 
from Mesoplasma florum (MfR2) represents an 
attractive model to study active radical states 
in RNRs. Its radical state is metal-independent 
which simplifies sample preparation and ex- 
cludes partial occupancy or mismetalation of 
the metal site often encountered with metallo- 
enzymes in vitro (75). In addition, in absence 
of protein R1 its radical state is stable with no 
decay observed after more than six hours at 25°C 
(13). However, as with any radical state, it is 
expected to be highly sensitive to photoreduction 
as x-ray radiation damage generates free radicals 
that spread through protein crystals (16). This 
obstacle renders the structural characteriza- 
tion of a protein radical state using standard 
x-ray crystallography methods unfeasible (77, 18). 
Though many crystal structures of iron- and 
manganese-containing R2 proteins from dif- 
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ferent organisms have been obtained, | Che 
: ; ‘ upce 

describe either the nonactivated state,\.- 
radical-lost ground state (often referred to as 
the “met” state) or partially reduced states. Crys- 
tal structures of R2e have been solved from two 
different organisms (13, 14), likewise showing 
signs of x-ray-induced photoreduction. 

Discrepancies between R2 crystal structures 
and spectroscopic data of radical states have 
been observed, leading to contradictory theo- 
retical models regarding the conformation of Ye 
and its environment (8, 79-21). In the present 
study, by rapid protein production, microcrys- 
tallization and x-ray free-electron laser (XFEL) 
serial femtosecond crystallography, using the 
diffraction-before-destruction principle (22), 
we have determined the atomic structure of 
MfR2 in the active radical state at room tem- 
perature. Compared with the structure of the 
radical-lost ground state, also presented here, 
the radical state structure reveals a notably 
short hydrogen bond and a critical rearrange- 
ment of conserved residues upon acquisition of 
the radical. Based on these two distinct states, 
we propose a mechanism for structural recog- 
nition of the radical state and a model for 
redox-coupled conformational gating as a pro- 
logue to the radical transfer. This mechanism 
defines central aspects of the PCET process and 
may be conserved in aerobic RNRs. 


Structure of the radical-lost ground state of MfR2 


In the catalytically active radical state, the 
radical-harboring meta-hydroxylated tyrosyl 
(from here denoted pop4Y126¢) in MfR2 exhib- 
its a characteristic absorbance peak at 383 nm 
and colors the protein blue (73). The catalytic 
radical can be chemically quenched by hydro- 
xyurea, a known RNR radical scavenger used 
for decades as an antitumor drug (23) pro- 
posed to inactivate R2 through PCET (24). Im- 
portantly, hydroxyurea causes a reversible 
inactivation of MfR2 as the enzyme can recover 
activity upon reoxidation by NrdI (13). Thus, 
the protein is not permanently inactivated or 
damaged but resides in a radical-lost ground 
state (Fig. 1A). 

To ensure an accurate depiction of the 
radical-lost ground state of MfR2, we solved the 
crystal structure at 1.35 A resolution of the pro- 
tein chemically quenched by incubation with 
hydroxyurea before crystallization. This treat- 
ment abolished the 383-nm absorbance peak 
and rendered the protein colorless (13). The 
electron density map clearly shows the post 
translational modification in the meta posi- 
tion of popsY126 (Fig. 1B). The two oxygen- 
containing functional groups of pop4Y126 in 
the para and meta positions (denoted para-O 
and meta-O, respectively) form hydrogen bonds 
(H-bond) with D88. Notably, the meta-O is in- 
volved in an unusually short interaction (Fig. 
1B). Using END/RAPID error analysis (25), the 
O-O distance between pop4Y126 and D88 was 
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Fig. 1. Radical state and radical-lost ground state of MfR2. (A) Outline of 
the proposed activation pathway of metal-free R2e, including the nonactivated 
state, catalytically active radical state, and radical-lost ground state (also known 
as “met” in canonical R2). The two states determined in this work are indicated 
in green. The structure of R2e in the nonactivated state has been determined 
previously (13, 14). For clarity, chemical reactions are not strictly balanced. 

(B) Structure of MfR2 in the radical-lost ground state obtained after chemical 
quenching by hydroxyurea (monomer A is shown). (C) Structure of MfR2 in the 
radical state obtained from XFEL serial femtosecond crystallography showing 

a reorganization of the site compared with the ground state including a coupled 
movement of the popaY126-D88 dyad, the presence of a new water wl, and 
the inward conformation of K213. The short H-bond is highlighted in orange. 
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(D) Superimposition of the ground and radical states. The 2-A displacement of 
the DOPA para-O is marked in purple. Nitrogen and oxygen atoms are shown 
in blue and red, respectively. Carbons are shown in gray and cyan for the radical- 
lost ground state and radical state, respectively. Distance between atoms involved in 
H-bond interactions are in A. Simulated annealing composite Omit 2F,-F, electron 
density maps are shown in green and contoured at 2 o. The structural changes 
are further illustrated in movie S1. (E and F) Using quantum mechanical calculations 
on the XFEL structure, the short H-bond between pop4Y126 and D88 can be 
reproduced by a DOPA radical state with the radical mainly located on the para-O 
and the proton located on meta-O (E) (neutral DOPA radical state) or D&88 (F) 
(negatively charged DOPA radical state). For clarity, only a subset of residues 
included in the calculations is presented on the figure (see fig. S3 for full details). 


calculated to 2.43 + 0.04 A, a length which 
could correspond to a low-barrier or single- 
well H-bond (26, 27). In addition, pop4Y126 
does not interact with any water, nor with 
K213 whose e-ammonium group is facing 
away from popaY126 (Fig. 1B). Compared with 
previous R2e structures, differences and al- 
ternate conformations are observed through- 
out the structures, particularly variations in 
the conformation of nop4Y126, D88, and K213 
(fig. SIA). 


Structure of the radical state of MfR2 by 
serial femtosecond crystallography 


To circumvent photoreduction artifacts, we used 
XFEL serial femtosecond crystallography to 
determine the structure of the catalytically 
active radical state of MfR2 at atomic reso- 
lution. The active radical-harboring protein 
was batch crystallized to produce a suspension 
of blue microcrystals used to collect room tem- 
perature serial femtosecond diffraction data at 
an XFEL source. The resulting dataset produces 
a model of the protein at 1.5-A resolution, and 
the electron density map allows unambiguous 
interpretation (Fig. 1C). A short H-bond be- 
tween popsY126 meta-O and D88 is present 
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with a O-O distance calculated to 2.41 + 0.05 A 
using END/RAPID error analysis (25) (see 
Methods for details). A short H-bond is ob- 
served in both states of the protein and may 
play a special structural role as suggested in 
other cases (28), contributing to maintaining 
the integrity of the enzyme active site (29). 
The short H-bond may stabilize the interaction 
between popsY126 and D88 to ensure that no 
hydrogen is available to mediate a putative pro- 
ton transfer from D88 to the popsY126 para-O, 
which would annul productive PCET. We note 
that this H-bonding structure results in a sit- 
uation analogous to canonical R2 proteins in 
which a deprotonated aspartate is involved in 
metal coordination and not in proton transfer, 
thus forcing the latter to occur with a different 
nearby proton donor, suggested to be a metal- 
bound water molecule (77). Furthermore, it is 
tempting to speculate that this short H-bond is 
involved in redox tuning. By preventing the 
radical delocalization between the meta-O and 
para-O, the DOPA radical becomes electroni- 
cally more similar to a tyrosyl radical, rather than 
a DOPA-semiquinone radical, in agreement with 
previous characterization of M/R2 by electron 
paramagnetic resonance (EPR) (13). 
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A notable conformational shift takes place 
upon radical acquisition between pop4Y126 and 
D88: the dyad undergoes a coupled coplanar 
(but not coaxial) rotation of ~22° with an ad- 
ditional rotation of the aromatic ring of ~12° 
along the Cg-C, axis. It results in a 2-A dis- 
placement of the para-O carrying the main 
radical spin density away from D88 and a re- 
organization of the interaction pattern around 
popaY126 (Fig. 1D). The aspartate residue cor- 
responding to D88 in MfR2 is strictly con- 
served as the N-terminal metal-coordinating 
residue in Ye-harboring canonical R2 proteins 
and exhibits redox-induced conformational 
changes in metal-containing R2 proteins 
(8, 17, 30, 31). Furthermore, coupled movements 
of the conserved radical-harboring Y have been 
observed to be redox-induced in R2b from 
Bacillus anthracis (17). For R2a from E. coli 
(EcR2a), the conserved aspartate is proposed 
to form a H-bond with the reduced Y in the 
ground state. By contrast, single-crystal EPR 
experiments suggest that the Ye rotates away 
in the radical state, leading to a ~1-A displace- 
ment of the radical-harboring oxygen and 
breaking the connection with the aspartate 
(8). A displacement of the Ye could also be 
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hypothesized from discrepancies observed 
between crystal structures and spectroscopic 
data for R2 proteins from Bacillus anthracis 
(32), Salmonella typhimurium (33), Coryne- 
bacterium ammoniagenes (30) and mouse (19). 
This type of rearrangement upon acquisition 
of the radical is principally similar to what 
we observe in MfR2 structures and is less 
pronounced than movements proposed in 
other studies, which involve either translation 
of the main chain or larger Ye displacement by 
several A (20, 21, 24). 

In addition, a clearly defined water molecule 
(wl) mediates a new H-bond between pop4Y126 
and the e-ammonium group of K213, which 
displays a different orientation facing toward 
popaY126 (Fig. 1C). K213 adopts a single well- 
defined conformation different from previously 
published structures of active R2e determined 
using synchrotron radiation (13) (fig. SIB). The 
presence of a water molecule at a position sim- 
ilar to wl has been observed previously in other 
RNR systems (34, 35), and seems to be de- 
pendent on the redox state (17) (fig. S2). More- 
over, superimposition with structures of R2 
proteins shows that the new location of the 
K213 e-ammonium group corresponds to the 
position of another water molecule that is metal- 
coordinated in canonical R2 proteins (fig. $2). 
This water is proposed to transfer a proton to Ye 
in the conformational gate initiating the PCET 
(11, 36, 37). Residue K213 was recently suggested 
by density functional theory to be a proton 
donor for radical transfer in R2e (38). There- 
fore, it may represent the water-equivalent 
proton donor in the case of R2e, and thus its 
conformational change could effectuate a com- 
parable conformational gate. 

Comparing the structures of the defined rad- 
ical and radical-lost states determined here to 
previously solved structures of R2e proteins 
shows that no previous structure fully repre- 
sents either the radical state or the radical- 
lost state (fig. S1). Although the proteins in 
prior work may have originally crystallized in 

S K213 ot es 


Q91 


N 
= 


, 
Y popaY 126 
radical-lost ground state 


Fig. 2. Major conformational changes between the radical-lost ground 
state and the radical state. (A) Structure of MfR2 in the radical-lost ground 
state obtained after chemical-quenching by hydroxyurea. (B) Structure of MfR2 
in the radical state obtained from XFEL serial femtosecond data showing 
reorganization of the site compared with the radical-lost ground state, including 
conformational changes of Q91, L183, and F187. In the radical state, Q91 
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the “active form,” they appear to have suffered 
different degrees of x-ray-induced photo- 
reduction during synchrotron data collection. 


The XFEL structure can be reproduced in 
silico by a radical state 


To evaluate whether the crystal structure ob- 
tained by XFEL femtosecond crystallography 
theoretically corresponds to a radical state, 
calculations were performed on the crystal struc- 
ture active site. Based on quantum chemical 
geometry optimizations, the short interaction 
between popsY126 and D88s could be reproduced 
with the main radical character on the para-O. 
The proton could reside on either pop4Y126 or 
D88 as both states produced a short O-O dis- 
tance (Fig. 1, E and F, and fig. S3, A and B). The 
calculated energy difference of 3 kcal/mol be- 
tween the two states suggests that both states 
are accessible with slight favoring of the proton 
residing closer to pop4Y126¢. In addition, var- 
ious alternative pop4Y126 states were modeled 
by quantum mechanical calculations, none of 
them agreeing well with the experimental ob- 
servations. In particular, a longer hydrogen bond 
with D88 is observed when pop4Y126 is modeled 
as a neutral DOPA, a DOPA quinone, or with 
the radical located on the meta-O (fig. $3, C to 
E). Furthermore, the calculated spin population 
of popaY126¢ in the protein active site models 
revealed an asymmetric distribution closer to 
a meta-substituted Ye than the fully delocalized 
character of an ortho-semiquinone, based on 
comparisons with calculated distributions in 
smaller models. This is fully consistent with 
spectroscopic data of the radical (13, 14) and 
indicates a possible role for the short pop4Y126- 
D88 hydrogen bond to destabilize the other- 
wise potentially too stable semiquinone radical 
in the protein. 

Molecular dynamics simulation starting from 
the XFEL structure of the radical state but with 
induced loss of the radical in silico, showed 
popAY126 movement with the dynamics domi- 
nated by the position of the radical-lost ground 


radical state 
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state, forming two H-bonds to D88 (fig. $3, F 
to H), consistent with the crystal structure. 
Altogether, our calculations support that the 
structure obtained by XFEL corresponds to 
the catalytically active radical state of the MfR2 
protein and are in agreement with previous 
EPR and UV-vis spectroscopic results (13), 
showing that the spin density distributes sim- 
ilar as in a meta-substituted tyrosyl radical ra- 
ther than as in an ortho-semiquinone. 


Specific protein rearrangements upon 
radical acquisition 


The radical acquisition in MfR2 leads to two 
major protein rearrangements that are of par- 
ticular interest as they can be directly impli- 
cated in radical generation, stabilization, and 
transfer. The first major protein rearrangement 
takes place within the activating-oxidant path. 
The channel connecting the NrdI flavin cofactor 
to the R2b metal site (39, 40), proposed as the 
0," route for activation, seems to be con- 
served in R2e (/4). In the radical-lost ground 
state of MfR2, in place of the metal center 
present in canonical R2, a continuous chain of 
well-defined H-bonded water molecules cre- 
ates the link between the putative oxidant 
route and the pop4Y126-D88 dyad. By contrast, 
in the radical state of MfR2, this water network 
is disrupted by Q91 which undergoes a large 
sidechain flip toward D88 (Fig. 2 and movie 
S1). In the radical-lost ground state, Q91 is 
involved in a H-bond network conserved in the 
R2b subclass (Q70 in R2b from E. coli), which 
lines the channel for oxidant transport to the 
Mn", active site (40). Our data suggest that 
Q91 could play a key role as it obstructs the 
putative oxidant channel in the radical state, 
preventing radical quenching by further oxi- 
dant access to the active site. 

The second major protein rearrangement 
upon radical acquisition concerns the PCET 
route. In the immediate vicinity of the side chain 
of popaY126, residues L183 and F187 undergo 
large conformational changes in the radical 


superimposition 


displaces two water molecules, breaking the water H-bond network toward 
popaY 126. Structural movements of L183 and F187 leave space for 3 water 
molecules interacting with D88, pop4Y126, and K213. Simulated annealing 
composite Omit 2F,-F, electron density maps are shown in green and contoured 
at 1.5 o. (C) Superimposition of the structures of MfR2 in the radical state 
(cyan) and radical-lost ground state (gray). 
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state of MfR2, leaving space for the position- 
ing of three water molecules (including w1) 
and creating a water-mediated H-bond net- 
work between popsY126, D88, K213, and Q91 
(Fig. 2 and movie S1). Residues L183 and F187 
belong to helix aE which forms a distorted 
m-helix conserved across many ferritin super- 
family members. The m-helix conformation of 
helix aE is believed to play a functional role 
(41, 42), and is known to undergo redox-induced 
structural rearrangements (/7, 30, 34). Resi- 
dues L183 and F187 are conserved in all R2 
proteins, the position of L183 being either F 
or L (in 91 and 9% of sequences, respectively) 
and F187 being conserved in 99% of the se- 
quences. In none of the R2 crystal structures 
solved to date do these residues exhibit con- 
formations resembling those in the MfR2 rad- 
ical state (fig. S4). In the first solved structure 
of R2, it was noted that the radical-harboring 
tyrosyl oxygen was surrounded by a conserved 
hdrophobic pocket formed by residues F208, 
F212, and 1234 in EcR2a (equivalents to L183, 
F187, and 1209 in MfR2, respectively) (43). The 
major function of these residues was proposed 
through mutational studies to contribute to 
the tyrosyl radical stability by insulating the 
radical-harboring tyrosyl oxygen (44). Our ob- 
servation of hitherto unseen movements of 
these radical-shielding residues further impli- 


cates them in radical control and suggests their 
involvement in gating of the ribonucleotide 
reductase PCET mechanism. The specific local 
rearrangements at the radical site also trans- 
late to movements of the protein backbone and 
global structural changes of the protein scaffold 
protruding to the R1 interaction surface and the 
radical transfer path (movie S2). It has previ- 
ously been shown that the active R1-R2 complex 
exhibits tighter binding after radical initiation 
(45-47). We propose that these global structural 
changes observed in R2 provide a mechanism 
by which the R1-R2 binding properties can be 
modulated during the catalytic cycle. 


Model of conformational gating for radical 
transfer initiation 


In MfR2, the catalytically active radical state and 
the radical-lost ground state are interconvertible 
by quenching the radical and through NrdI- 
mediated reoxidation of pop4Y126 (Fig. 1A). 
Based on our structural data, we propose a 
model of the conformational gating orches- 
trated by R2 after radical acquisition, which is 
a prelude to the radical translocation to the R1 
subunit (Fig. 3). This model proceeds in three 
steps. First, the oxidation of pop4Y126 leads 
to a repulsion between its para-O and D88 
due to the removal of the H-bonding hydro- 
gen atom, resulting in the 2-A displacement 


C 


of the popsY126 para-O (Fig. 3A). Secondly, this 
triggers a cascade of structural changes to 
shield the radical and prepare its transfer: Q91 
blocks the access to further oxidant, L183 and 
F187 reshape the insulating pocket around the 
radical, and water wl connects the pop4Y126 
radical-carrying oxygen with the PCET route 
(Fig. 3B). This also leads to global structural 
rearrangements of protein R2, including the 
R2-R1 interaction surface and binding of 
protein R1 (Fig. 3C and movie S82). Thirdly, the 
formation of the R1-R2 complex results in the 
ordering of the full R2 C-terminal tail at 
the R1-R2 interface [as demonstrated in (7)], 
completing the electron transfer path and in- 
ducing the injection of an electron to reduce 
popaY126e, e.g., through the conserved W52 
and/or Y325 (corresponding to W48 and Y356 
in EcR2a), as previously suggested (72) (Fig. 3D). 
Coupled to this event, a proton transfer from 
K213 to pop4Y126¢ occurs by means of the water 
wl [as proposed in (38)] and initiates the long- 
range radical translocation. Figure 3 summarizes 
the proposed steps of conformational gating in 
R2e. The interaction with R1 is modeled based 
on a superposition of the cryo-EM structure of 
the R1-R2 holocomplex from E. coli (7). 

This conformational gating model for ini- 
tiating the radical translocation could be com- 
mon to class I RNR systems. In PCET, proton 


Fig. 3. Model of conformational gating and radical transfer during PCET 

in R2e. (A) Oxidation of the radical-lost ground state popaY126 (gray) displaces 
popaY126-D88 and introduces a water molecule (wl) to facilitate the proton 
transfer from the redirected amino group of K213 (cyan). (B) Concurrently, the 
cavity surrounding the generated pop4Y126 radical is reshaped, shown in 
surface representation (radical-lost state and radical state in gray and cyan, 
respectively). Primarily, Q91 flips away from Y163 to block access of any further 
oxidant from Nrdl through the R2-Nrdl channel and L183 and F187 flip to reshape 
the cavity and facilitate electron transfer from W52 to popaY126. (C) Structure 
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superposition of MfR2 in 
the R1 binding interface. 


the radical (cyan) and radical-lost (gray) states at 
Reshaping of the pockets surrounding the popaY126 


leads to conformational changes protruding to the R2-R1 interaction surface and 


binding to protein R1 co 


mpletes the PCET pathway. (D) PCET is initiated by a 


proton transfer (PT) from the amino group of K213 to popaY126 mediated by wl, 


and simultaneously an e 


ectron transfer from W52 to popaY126, thus trans- 


locating the radical toward the active site C394 of R1. R1 (represented in red) 


is modeled in complex w 


ith the radical-state MfR2 using the cryo-EM structure of 


the R1-R2 holocomplex from E£. coli (PDB ID: 6w4x) (7). 
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transfer occurs through H-bond networks and 
requires the proton donor and acceptor to be 
within a standard ~2.8-A H-bond distance (48). 
As a displacement of the radical-harboring 
Y upon radical acquisition is also observed by 
spectroscopy in other R2 proteins (8, 19, 30, 32, 33), 
an intermediary may be required between the 
metal-bound water and Ye. In MfR2 the ad- 
ditional water wl gained upon radical acquisi- 
tion represents the missing piece of the puzzle, 
connecting the radical to K213 (Fig. 3D). We 
note that an analogous binding site for water 
has been observed in other R2 proteins (fig. 
$2), and that the e-ammonium group of K213 
is located at the position of the metal-bound 
water in metal-containing R2s (fig. S2). There- 
fore, we speculate that, similarly as in R2e, a 
water in this position links the radical to the 
metal-bound water proposed to be the proton 
donor in canonical R2 proteins and gates the 
first proton transfer to initiate radical translo- 
cation to R1. 
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A healthier routine 


re you all right?” the student asked as she passed me in the hallway. I was in the final stretch 
of lab work before writing my dissertation. The student, who was 4 years behind me in the 


same Ph.D. program, went on to say, “I see people from your cohort working so much, putting 

in extra hours, even at night.” She was concerned about me, but afterward, I realized she also 

wondered whether the same was expected of her. She didn’t know I had quietly rethought my 

work routine midway through my Ph.D. In that moment I knew I had a responsibility to tell 
students who followed in my footsteps that there was another, more sustainable way to approach their 
work. It’s a message I wish I’d heard when I began graduate school. 


As a first-year Ph.D. student, I was 
lucky to have kind and understand- 
ing advisers. But I felt overwhelm- 
ing pressure to become a flawless 
student. I kept finding a million 
reasons why I was coming into 
graduate school at a disadvantage 
compared with my peers. As a stu- 
dent from Colombia, I had been 
educated at institutions my pro- 
fessors and colleagues had never 
heard of, I spoke with an accent 
that was hard to understand, I 
blushed every time I spoke in pub- 
lic, and I was older than most of 
the people in my cohort. I wanted 
to be the best possible represen- 
tation of an international Latina 
student. But I couldn’t shake the 
feeling that I was an impostor in 
my new academic setting. 

The lack of firm deadlines and 
clear expectations in my program added to my anxieties and 
left me without a clear path forward. So, I instead looked 
around to get an idea of how I could fit in and impress my col- 
leagues. I noticed that most of the people around me seemed 
to be working all the time. I began to spend many weekday 
nights and weekends in the lab, even when it wasn’t strictly 
required for my project. 

After I passed my qualifying exam at the end of the first 
year, though, my strategy backfired. I crashed from burnout. 
The fatigue took a toll on my productivity and motivation. 
During one of many mindless Sundays in the office, I real- 
ized I'd fallen into the trap of assuming my colleagues’ work 
habits were the standard for belonging and being worthy of 
respect—and it wasn’t sustainable. 

From then on, I stopped counting the number of work 
hours I was putting in, leaving space every day for exercising 
and doing activities that I enjoyed. My new approach brought 
guilt and fearful feelings at first, but I pushed through with 


“Td fallen into the trap of assuming 
my colleagues’ work habits 
were the standard for belonging.” 


the aid of meditation and journal- 
ing. Eventually, I realized nobody 
was keeping track of the amount 
of time I spent in the lab anyway 
and it wasn’t something I needed to 
worry about. 

I noticed that having more time 
for myself made me more rested 
and motivated when I went back 
to work. And as I developed a more 
positive mindset, I became aware 
of the unique contributions I could 
make to my research precisely be- 
cause of my distinctive background. 
My impostor syndrome began to 
dissolve, and my confidence grew. 

I continued quietly along this 
path for years, until that conversa- 
tion in the hallway. The student was 
making the same mistake I did at 
her stage, setting for herself the un- 
healthy standard of what the older 
students were doing. I realized then that I could be a positive 
role model for others, and I had to be more vocal about what 
had worked well for me. 

So, when I ran into her again, weeks later, I told her that 
even though I was feeling a lot of pressure in the final year 
of my Ph.D., I was putting my mental and physical health 
first and wasn’t working overtime. I also began to talk with 
other students about how quality sleep, physical activity, 
healthy eating, and reasonable work hours were a daily part 
of my routine. 

The last year of my Ph.D. was hard. But I did my best to 
stick to a healthy work-life balance, and I felt good knowing 
I was doing my part to set a good example for my peers and 
combat the workaholic culture around me. It’s an approach I 
plan to continue throughout my career. 


Daniela Osorio Rodriguez is a postdoc at the California Institute of 
Technology. Send your career story to SciCareerEditor@aaas.org. 
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Spinning Disk Microscope System 
CrestOptics has announced the 

launch of a new spinning disk confocal 
microscope system called CICERO. 
Focusing on essential components 

of epi-fluorescence and confocal 
microscopy, CICERO was first developed 
to expand the company's portfolio and 
make high-end imaging more accessible 
to the scientific community at large. The 
result is a compact and cost-effective 
imaging system, comprised of a spinning disk confocal instrument 
that can be easily integrated into existing workflows with maximum 
configuration flexibility to support a seamless and user-friendly 
transition. 

CrestOptics 

For info: +39 066-166-0508 

crestoptics.com/cicero/ 


Versatile Flow Cytometry Dye 

Life science and clinical diagnostic supplier Bio-Rad Laboratories, 
Inc. has announced the release of its latest StarBright Dye range, 
StarBright Red 670. The 29th in the StarBright product line, Red 

670 provides researchers with validated flow antibodies against 
key immunology targets, conjugated to proprietary fluorescent 
nanoparticles. Serving as an expansion of its current range of highly 
validated antibodies, Red 670 is conjugated to StarBright Blue 700 
and StarBright Violet 610 Dyes. Compared with former StarBright 
Dyes, Red 670 offers additional human and mouse targets. It also 
introduces dog, cow, and pig targets—a first for StarBright—a new 
range providing for greater freedom and flexibility. In flow cytometry, 
the additional targets work both with conventional panels and with 
full-spectrum multicolor panels, expanding research capabilities 
across veterinary immunology research. Like all StarBright Dyes, 
Red 670 needs no special buffers for compatibility with the Bio-Rad 
ZE5 Cell Analyzer and S3e Cell Sorter, most flow cytometers, and 
most experimental protocols. Minimal lot-to-lot variation ensures 
reproducible and consistent staining, and the dyes are resistant to 
photobleaching with no loss of signal in fixation. 

“Now totaling 29 dyes for the 355, 405, 488, 561, and 640 nm lasers, 
the Bio-Rad StarBright Dye range is currently the largest series of 
dyes for conventional and full spectrum flow cytometry,” says Dr. 
Mike Blundell, Flow Cytometry Product Manager at Bio-Rad. “By 
extending our existing portfolio of 35 antibody markers to include 
another 29 human, mouse, and now cow, dog, and pig targets, we 
can offer our customers greater flexibility and choice when designing 
multicolor flow cytometry panels, including for translational and 
immunological research in veterinary species.” 

Bio-Rad 

For info: +1 510-356-7909 

www.bio-rad.com/en-us/ 


Dual-Cap Microcentrifuge Tube 

Azenta Life Sciences has released Cap2, anew 0.2m! PCR 
microcentrifuge tube featuring a dual-cap design, to preserve 
the integrity of samples used in genetic testing. Acting as botha 
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collection and processing tube, Cap2 can minimize sample transfu is 
and improve efficiency by reducing wastage. Designed to ensure 
sample traceability in fields like pre-implantation genetic testing, 
each Cap2 tube has both a unique 2D datamatrix code anda 
human-readable numerical identifier to prevent sample tracking 
errors. Like most modern tubes, the new Cap2 was designed for 
broad compatibility. It works in conjunction with automated systems 
for code reading, capping, and sample management—from Azenta 
and all other industry-recognized manufacturers. Manufactured 
from high-quality virgin polypropylene in a Class 8 clean room, 

each batch of tubes is tested and certified as free from endotoxins, 
DNase/RNase, leachables, and extractables. 

Azenta 

For info: +1 888-229-3682 
www.azenta.com/products/0.2ml-dual-cap-sample-collection-pcr- 
tube 


Fixed-focus Radioactive Process Lens 

Resolve Optics has announced the availability of lenses used to 
monitor radioactive processes in nuclear power plants and nuclear 
waste reprocessing facilities. The fixed-focus lenses come in the 
standard range of 6mm, 9mm, and 25mm focal length. Designed 

to provide a focal length of 6mm or 9mm respectively, the Model 
214 and 286 radiation-tolerant lenses deliver high-quality images 
with minimal geometric distortion from 400-750nm. The lenses are 
designed specifically for use with a 2/3-inch image format radiation- 
resistant tube and CMOS cameras. Key features of Models 214 

and 286 include their wide field of view and variable iris control. 

For producing sharp images without a strong yellow tint, Resolve 
Optics developed the Model 313 25mm focal length fixed-focus lens. 
Although yellow-tinting has traditionally been among the limitations 
of using radiation-tolerant lenses on color CMOS cameras, the 
cerium-doped glass in the Model 313 produces high-quality natural 
color images. 

Resolve Optics 

For info: +44-1494-777100 
www.resolveoptics.com/radiation-resistant-lenses-2/ 


Super-Resolution Confocal Microscope System 

Nikon recently released the AX / AX R with NSPARC confocal 
microscope system, which combines Nikon's flagship AX / AX R 
confocal system with the Nikon SPatial ARray Confocal (NSPARC) 
detector to realize next-level sensitivity and super-resolution with a 
confocal instrument. The NSPARC is based on a single-pixel photon 
counter (SPPC) array detector that brings the power of image 
scanning microscopy (ISM), an innovative technique capable of 
providing 100 nm lateral (XY) resolution, to an otherwise traditional 
point-scanning confocal system architecture. Not only does the 
NSPARC detector support super-resolution imaging, but it has 
superior sensitivity compared to traditional confocal detectors 

due to its ultra-low noise profile. This allows users to make full and 
practical use of the increased resolving power, even in challenging 
experimental conditions. Combine NSPARC detection with high- 
speed resonant scanning for a robust live-cell compatible video-rate 
super-resolution imaging solution. 

Nikon Instruments 

For info: +1-631-547-8500 
www.microscope.healthcare.nikon.com/ax 
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